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1 CCGCAACCCC GACGGCGCCC CAAACGCTGT TGCGCCGCGC GCCCCGCCCA 
51 GCCCGGCCTC GCGCTGGTCC CQGTCTCGCC CCGCAGCCCT CGATCTCCCG 3r 
101 TGAOTCCTC GGCCAGGCCG CCTGCGCCTC TGGGACCATG TTGCGCTGGC ,3 > 

151 TGCGGGACTT CGCGCTGCCC ACCGGGGCCT GCCAGGACGC GGAGCAGCCG ^ CD Ffl 

201 ACGCGCTACG AGACCCTCTT CCAGGCACTG GACCGCAATG GGGACGGAGT t« rf 

251 GGTGGACATC GGCGAGCTGC AGGAGGGGCT CAGGAACCTG GGCATCCCTC ^ m 

301 TGGGCCAGGA CGCCGAGGAG AAAAI I I I IA CTACTGGAGA TGTCAACAAA ^ 
351 GATGGGAAGC TGGATTTTGA AGAATTTATG AAGTACCTTA AAGACCATGA 4? § *C 

401 GAAGAAAATG AAATTGGCAT TTAAGAGTTT AGACAAAAAT AATGATGGAA ^ **» ftl 

451 AAATTGAGGC TTCAGAAATT GTCCAGTCTC TCCAGACACT GGGTCTGACT § 
501 ATTTCTGAAC AACAAGCAGA GTTGATTCTT CAAAGCATTG ATGTTGATGG 
551 GACAATGACA GTGGACTGGA ATGAATGGAG AGACTACTTC TTATTTAATC 
601 CTGTTACAGA CATTGAGGAA ATTATCCGTT TCTGGAAACA TTCTACAGGA 
651 ATTGACATAG GGGATAGCTT AACTATTCCA GATGAATTCA CGGAAGACGA 
701 AAAAAAATCC GGACAATGGT GGAGGCAGCT TTTGGCAGGA GGCATTGCTG 
751 GTGCTGTCTC TCGAACAAGC ACTGCCCCTT TGGACCGTCT GAAAATCATG 
801 ATGCAGGTTC ACGGTTCAAA ATCAGACAAA ATGAACATAT TTGGTGGCTT 
851 TCGACAGATG GTAAAAGAAG GAGGTATCCG CTCGCTTTGG AGGGGAAATG 
901 GTACAAACGT CATCAAAATT GCTCCTGAGA CAGCTGTTAA ATTCTGGGCA 
951 TATGAACAGT ACAAGAAGTT ACTTACTGAA GAAGGACAAA AAATAGGAAC 

1001 ATTTGAGAGA TTTATTTCTG GTTCCATGGC TGGAGCAACT GCACAGACTT 

1051 TTATATATCC AATGGAGGTT ATGAAAACCA GGCTGGCTGT AGGCAAAACT 

1101 GGGCAGTACT CTGGAATATA TGATTGTGCC AAGAAGATTT TGAAACATGA 

1151 AGGCTTGGGA GU I I I IACA AAGGCTATGT TCCCAATTTA TTAGGTATCA 

1201 TACCTTATGC AGGCATAGAT CTTGCTGTGT ATGAGCTCTT GAAGTCCTAT 

1251 TGGCTGGATA ATTTTGCAAA AGATTCTGTA AACCCTGGAG TCATGGTGTT 

1301 GCTGGGATGC GGTGCCTTAT CCAGCACCTG TGGTCAGCTG GCCAGCTACC 

1351 CATTGGCTTT GGTGAGAACT CGCATGCAGG CTCAAGCCAT GTTAGAAGGT 

1401 TCCCCACAGC TGAATATGGT TGGCCTCTTT CGACGAATTA TTTCCAAAGA 

1451 AGGAATACCA GGACTTTACA GAGGCATCAC CCCAAACTTC ATGAAGGTGC 

1501 TCCCTGCTGT AGGCATCAGT TATGTGGTTT ATGAAAATAT GAAGCAAACT 

1551 TTAGGAGTAA CCCAGAAATG ATGTTGCATT TTTTGCTTTA GCCTGATAAT 

1601 TGAAACTTTC AACAATCTCT GGAGTGACTT TTTCTCCTCG AATTGAAACA 

1651 AGTCTATGGC AAAAGAAGCT GCAI I I I I I I CACAAAAGGG AAGACGGTAA 

1701 CAATGGTCAC TTCAAACTTT TGGGCTAAAT TATATGTACA CAGAAATGTT 

1751 CAAAATCATA GTTTTAATGT GTTTTGAAAA GGCCACACAA TTATACTTTA 

1801 TCTTTTCTTA ATAATCCTGC AAATCTCTGC CCTGAATCCG AAATCTGAAA 

1851 ATGTACTGGC TTGAACAAAA TTTGTTTTGT GTGTTAGAGT TATAAATCAT 

1901 TAATCTTTAT TTCGGGTGGT TTACGTTTAT GCCAGTTCCT TTATATTTAA 

1951 ATTTCTTGTT TTATATATTT TGAATGTCTT TATAGATTTC TTTAAATTTC 

2001 CTTATAGAAC CATTAATAGA AAATCATTAC ATTTAAAATA TACCTTACAG 

2051 CAAAAGCATC CAAATAAGTA TAGGGTTTAT GTCCTTATTT TTCTTTCAGC 

2101 TGAATACGAA TGAACACAGT GGTGGAATTT CTGAAGGGAA GTGATGAAAT 

2151 TATATTTATT TCAGTGGGCA CTTTTCCATT TTACCACTGT ACCATTATTT 

2201 GGTTCCTGGA GTTATACACT AATTTTCAGT ATATTACTGT TAAATTACCA 

2251 ACACAAGGCA ATTTATTTGA AAGATTCCGT TTATCCTGCC ATTGCTTTGA 

2301 AAAGCAGCAG GAAACGAAAT I I I I IGACTT GTATCAGCTT CTGCAGAGCA 

2351 TCI I IGI I I I CCTTTGTCCT TTGTTTCCTA CCTTTTGAAT CAGATTCCGT 

2401 TTTAGTCAGG AAGACTTCTT GGGACCATTC TTAGTAACCT GAAATTTCTT 

2451 TTTTAATTGC ATGAAGTGGA TTGATCATGA GCAAGTGATG GGCTTTATTT 

2501 CTCCCTCACT GGTGAATATC CTTTGAACTT GCTGTTTGCA ATATGGGCAG 

2551 CCACAAAGGG GGAGAGATGC CTATTAAATC GGCGGGGTGT ATGACTTCTG 

2601 AAAACATTGG ATACCCTATT TTGAAAAGGG AAAGGCCCAA TTTGGGGAAA 

2651 CATATACCAA TGCATGATTT CTG (SEQ ID NO:l) 
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FEATURES: 

5'UTR: 1-137 

Start Codon: 138 

Stop Codon: 1569 

3'UTR: 1572 

HOMOLOGOUS PROTEINS: 
TOP BLAST Hits: 

CRA 1 335001098641184 /altid=gi 111360341 /def=pir| |T50686 peroxis. 
CRA| 11000479457833 /altid=gi 16841066 /def=gb|AAF28888.1|AFl2330. 
CRA 1 18000005183605 /a"ltid=qi 17504235 /def=pir| |T22688 hypotheti. 
CRA| 1000682325160 /a"ltid=gi 1 7499323 /def=pir| |T21074 hypothetic. 
CRA| 89000000196990 /altid=gi | 7294582 /def=gb| AAF49922.il (AE003. 
CRA| 150000075553401 /altid=gi 19758252 /def=dbi | BAB08751.il (ABO. 
CRA 1 335001098657884 /altid=gi 111358611 /def=pnr| |T49871 peroxis. 
CRA| 163000046661776 /a"ltid=gi 110176874 /def=dbi |BABl0081.1| (AB. 
CRA| 105000014652720 /altid=gi |10798831 /def=dbj |BAB16462.1| (AP. 
CRA| 335001098655048 /a"ltid=gi 1 11277065 /def=pir||T47703 Ca-depe. 

BLAST dbEST hits: 



9] 
91 
9] 
91 

gi 



10145202 /dataset=dbest /taxon=96.. 
1437155 /dataset=dbest /taxon=9606 
10333851 /dataset=dbest /taxon=96. . 
8469752 /dataset=dbest /taxon=960. . 
11684041 /dataset=dbest /taxon=96. . 



EXPRESSION INFORMATION FOR MODULATORY USE: 

library source: 

Expression information from BLAST dbEST hits: 

gi I 10145202 Placenta Choriocarcinoma 

gi I 1437155 Retina 

gi 110333851 uterus leiomyosarcoma 

gi 18469752 Breast 

gi 111684041 Ovary fibrotheoma 

Expression information from PCR- based tissue screening panels: 
Leukocyte 
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1 MLRWLRDFAL PTAACQDAEQ PTRYETLFQA LDRNGDGWD IGELQEGLRN =^ -n J3 

51 LGIPLGQDAE EKIFTTGDVN KDGKLDFEEF MKYLKDHEKK MKLAFKSLDK Q g ITI 

101 NNDGKIEASE IVQSLQTLGL T1SEQQAELI LQSIDVDGTM TVDWNEWRDY Z Q 

151 FLFNPVTDIE EIIRFWKHST GIDIGDSLTT PDEFTEDEKK SGQMa/RQLLA m \f m 

201 GGIAGAVSRT STAPLDRLKI MVQVHGSKSD KMNIFGGFRQ MVKEGGIRSL =° . — : 

251 WRGNGTNVIK IAPETAVKFW AYEQYKKLLT EEGQKIGTFE RFTSGSMAGA 55 g ^ 

301 TAQTFIYPME VMKTRLAVGK TGQYSGIYDC AKKILKHEGL GAFYKGYVPN S S IN 

351 LLGIIPYAGI DLAVYELLKS YWLDNFAKDS VNPGVMVLLG CGALSSTCGQ O 

401 LASYPLALVR TRMQAQAMLE GSPQLNMVGL FRRIISKEGI PGLYRGTTPN g 
451 FMKVLPAVGI SYWYENNIKQ TLGVTQK (SEQ ID NO: 2) 



FEATURES: 

Functional domains and key regions: 

[1] PDOC00001 PS00001 ASN_GLYGOSYLATION 
N-glycosylation site 

254-257 NGTN (SEQ ID N0:7) 

[2] PDOC00005 PS00005 PKC_PHOSPHOJSITE 
Protein kinase c phosphorylation site 

Number of matches: 2 

1 229-231 SDK 

2 475-477 TQK 

[3] PDOC00006 PS00006 CK2_PHOSPHOlJS3TE 
Casein kinase II phosphorylation site 

Number of matches : 8 

1 22-25 TRYE (SEQ ID NO: 8) 

2 65-68 TTGD (SEQ ID NO: 9) 

3 121-124 TISE (SEQ ID NO: 10) 

4 157-160 TDIE (SEQ ID NO: 11) 

5 170-173 TGID (SEQ ID NO: 12) 

6 179-182 TT.PD (SEQ ID NO: 13) 

7 185-188 TEDE (SEQ ID NO: 14) 

8 227-230 SKSD (SEQ ID NO: 15) 

[4] PDOC00008 PS00008 MYRISTYL 

N-myn'stoylation site 

Number of matches: 16 

1 52-57 GIPLGQ (SEQ ID NO: 16 

2 119-124 GLTTSE (SEQ ID NO: 17 

3 171-176 GIDIGD (SEQ ID NO: 18 

4 201-206 GGIAGA (SEQ ID NO: 19 

5 202-207 GIAGAV (SEQ ID NO: 20 

6 245-250 GGIRSL (SEQ ID NO: 21 

7 253-258 GNGTNV (SEQ ID NO: 22 

8 283-288 GQKIGT (SEQ ID NO: 23 

9 295-300 GSMAGA (SEQ ID NO: 24 

10 322-327 GQYSGI (SEQ ID NO: 25 

11 326-331 GIYDCA (SEQ ID NO: 26 

12 359-364 GIDLAV (SEQ ID NO: 27 

13 392-397 GALSST (SEQ ID NO: 28 

14 399-404 GQLASY (SEQ ID NO: 29 
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15 442-447 GLYRGI (SEQ ID NO: 30) =E 

16 446-451 GTTPNF (SEQ ID NO: 31) 

— I 

m 

ZD 

[5] PDOC00018 PS00018 EF_HAND 35 
EF-hand calcium-binding domain § 

ro 

Number of matches: 3 § 

1 32-44 DRNGDGWDIGEL (SEQ ID NO: 32) 

2 68-80 DVNKDGKLDFEEF (SEQ ID NO: 33) 

3 99-111 DKNNDGKIEASEI (SEQ ID NO: 34) 

Membrane spanning structure and domains: 
Helix Begin End Score certainty 

1 292 312 1.053 Certain 

2 345 365 0.613 Putative 

3 381 401 1.544 Certain 

4 446 466 0.733 Putative 

BLAST Alignment to Top Hit: 

>CRA 1 335001098641184 /al ti d=gi 1 11360341 /def=pi r 1 1 T50686 peroxi soma! 
ca-dependent solute carrier [imported] - rabbit 
/org=rabbit /taxon=9986 /dataset=nraa /length=475 
Length = 475 

Score = 927 bits (2371), Expect = 0.0 

Identities = 454/477 (95%), Positives = 466/477 (97%), Gaps = 2/477 (0%) 

Query: 1 MLRWLRDFALFTMCQDAEQPTRYETLFQALDRNGDGWDTC 60 

MLRWLR F LPTAACQ AE PTRYETLFQALDRNGDGWDI ELQEGL++LGIPLGQDAE 
Sbjct: 1 MLra^RGFVLPTMCQGAEPFTRYETLFOj\m^ 60 

Query: 61 EKIFTTGDVNKLXjKIJDFEERVIKYLKL^ 120 

EKIFTTGDVNKLXjKUDFEERVIKYLKDHE^ 
Sbjct: 61 EiaFTTGDVNKDGKUDFEEmKYLKXIHEKKMK 120 

Query: 121 TISEQQAELILQSIDVDGTMTVDWNEWRDYFLFNFV^ 180 

TESEQQAELILQSID DGTMTVCWNEWRDYFLFNPV DIEEIIRFWKHSTGIDIGDSLTI 
Sbjct: 121 TISEQO^LILQSroADGTMTVDWNEWRD 180 

Query: 181 PDEFTEDEKKSGQ^QLI^GGILAGAVSR^ 240 

PDEFTE+E+KSGQWWRQLLAGGIAGAV5RTCT MNIFGGFRQ 
Sbjct: 181 PDEFTT£EERKSGQM/\/RQLLAGGIAGAVSRT5T^ — MNIFGGFRQ 238 

Query: 241 mvkegg:u^lwrgngtni\/iic^ 300 

M+KEGGHISLWRGNGTT^KIAPETAVKFW YEQYKKLLTEEGQKIGTFERFISGSMAGA 
Sbjct: 239 MIKEGGVRSLV\RGNGTNVIIGAPETAWFIWYE^ 298 

Query: 301 TAQTFIYPMEVMKTRUWGKTGQYSG^ 360 

TAOJFTyTMEVMl<TRIJ\VGKTGQYSGIYDCAKKILK+ GAFYKGYVPNLLGIIPYAGI 
Sbjct: 299 TAOTFIYPMEVMKTRUWGKTGQYSGIYLXIAKK^ 358 

Query: 361 DLAVYELLKSYWU^IFAKDSVNKS^^ 420 

DLAVYELLKS^UDNFAHDSVNF^aV+VLLGCGAI^STCGQ 
Sbjct: 359 DLAWELLKSHWUDNFAKDSVNPGVLVLLGCGALSSTC^ 418 
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Query: 421 GSPQU^MJLRTOISKEGIPGLYRGi™ 477 (rescues 1 
477 of SEQ ID NO: 2) ^ >7 

G+PQUvlWGLFRRIISKEG+PGLYRGTTPNFMKVLPAVGK^ ^ Ctf 

Sbjct: 419 GAPQUVMVGLFRRIISKEGLPGLYRGITPNFMKVLPAVGI^^ 475 m ^ 
(SEQ ID NO:4) ^ ^ 

>CRA 1 11000479457833 /al ti d=gi 1 6841066 /def=gb | AAF28888 . 1 1 AF123303JL ^ § 

(AF123303) calcium-binding transporter [Homo sapiens] « 4-0 / 

/org=Homo sapiens /taxon=9606 /dataset=nraa /length=411 § f 

Length = 411 v 

Score = 834 bits (2132), Expect = 0.0 
Identities = 409/410 (99%), Positives = 409/410 (99%) 

Query: 8 FALPTMCQDAEQPTRYETLFQAIJDRNGDGWDIGELQE^ 67 

F LPTMCXPAEQPTRYETLFO^UDRNGDGVVDIGELQEGLRNLGIPLGOJDAEEKIFTTG 
Sbjct: 1 FVLFTMOQDAEQPTRYETLFQAIJDRNGDGVvDIGELQEGLfWLGIPLQ 60 

Query: 68 DVNKDGKU3FEEFMIOr1J<DHEKKMK^ 127 

DVNKDGKLDFEEFMKYLKDHEKI^^ 
Sbjct: 61 DVNKDGKLDFEEFMC^KDHEKKNIKUTO 120 

Query: 128 EIJLQSlDVDGTlvnMMJEWRDYFLnsIP 187 

EIJELQSTDVDGTMTVC^EWRDYFL 
Sbjct: 121 ELJLO^ILA/DGTMTVDa/NEWRDYFLFN 180 

Query: 188 EKKSGQfllWRQLU\GGIAGAVSRTSTAPmRLI^^ 247 

EKKSGQWWRQLU\GG:D\GAVSRTS^ 
Sbjct: 181 EKKSGQ^QLU\GGIAGAVSRTSTAPmRLiaiv^ 240 

Query: 248 RSLWGNIGTNN/IKIAPETAWFMYEQYK^ 307 

RSLVyRGNGTNVIKIAPETAWR/MY 
Sbjct: 241 RSLV\RGNGTNVIKIAPETAVKFta^^ 300 

Query: 308 PMEVMKTRUVVGKTGQYSGIYDCAKIQLKHEGLG^ 367 

PMEVMKTRI^VGKTGQYSGIYDCAKJQLK^ 
Sbjct: 301 PMEVMORUWGiaGQYSG:^^ 360 

Query: 368 LXSYv^NFAKT^VNKSVMVLLGCGAl^STCGQ 417 (residues 8-417 of 

SEQ ID NO:2) 

LKSYWLIDNFAKDSVNPGVIWLL^^ 
Sbjct: 361 LKSYWUJNFAKDSVNFKMWLLGCG^ 410 
(SEQ ID NO: 5) 

Score = 80.0 bits (194), Expect = 6e-14 

Identities = 80/388 (20%), Positives = 156/388 (39%), Gaps = 59/388 (15%) 

Query: 95 FKSLJDKNNDGKIEASEIVQSLQTLGLTTSEQQAELJCLQSIDV — LXJIMTVDWNEWRDYFL 152 

F++LD+N DG ++ E+ + L+ LG+ + + E I + DV DG + 
Sbjct: 21 FQALI)RNGDGWDIGELQEGLJWLGIPLGQDAEEKIFTTGDvNI<re 68 

Query: 153 mP\m)IEEIIRFW1STGIDIGDSLTIPDEFTEDEKXSGQVM^QLIJ\GGIAGAVSR^ 212 

DEE+++K +EKK++L + 

Sbjct: 69 DFEEFMKYLK DHEKK^LAFKSUDKNISIDGKIEASEIV 105 

Query: 213 APLDRLKINMQVHGSKSDKWIF^ 272 

LL + + ++ +1 V R + N I E -H-FW + 
Sbjct: 106 O^LOJLGLTTSEQQAELILQSIDN^XJTMR^JEVMJYFLmPVroi EEIIRFWKH 161 
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2 

O 

273 EQYKKL LTEEGQKICTFER-FISGSmGATAQTFIYPMEVMKTRLAV-GKT 3215? 

+ TE+ +K G + R +-Kj +AGA ++T P++ +K + V G ~3 

162 STGIDIGDSLTTPDEFTEDEKKSQQWWRQLU^GGIAGAVSRT^ 221^5 

322 GQYSGIYDCAKKILKHEGLGAFYKGWW 38@ 
1+ ++++K G+ + -HG Mf+ I P + YEK ++ K§ 

222 SDKMNIR3Gn*QWKE«nRSLW^^ LTEEGQ 27 



382 NFGVMVLLGGGALSSTCGQLASYPUkLVR^^ 441 
G G+++ Q YP+ +++TR+ A+ + + ++I+ EG+ 

Sbjct: 278 laGTFERRESGSMAGATAQTFIYPMEVMKTRL AVGKTGQYSGIYDCAKKILKHEGLG 334 

Query: 442 GLYRGrTPNFMKVLPAVGISYWYENVIK 469 (residues 95-469 Of SEQ ID NO: 2) 

Y+G PN + ++P GI VYE +K 
Sbjct: 335 AFYKGYVPNLLGIIPYAGIDLAVYELLK 362 (SEQ ID NO: 6) 
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Hmer search results (Pfam): 
Model Description 



Score E-value N 



earner proteins 



PF00153 Mitochondrial 

PF00036 EF hand 

PF00404 Dockerin domain type I 

PF01978 Protein of unknown function 

Parsed for domains: 
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1 AACCCATGTT AGTGTGCAGT TCTGCTGGCA CACACATGCA GTTGTGTAAC 
51 CACTACCACC AAAAGCAAGA TGTAAAATAG CTCCATCACC CCCACAAGCC 
101 TTCTGATGCT CTTTTGTCAT CAATTCCCTT CCCGCTAGTC ACAACTGGTA 
151 ACTACTGATT TGTTTTCTGT CCCTATAGTT TTGCCTTTTC CAGAATGTCA 
201 TTGTTGACAG GTATCAGTAA TTCATTCCTT TTTATTGCTA ATTACTATCT 
251 CACTGTATGA ATGCAACACA GGTTGTTTAC CAGTTCACCC GTTAAAGAAC 
301 ATTTTGTTTC TGCGCTTGAC AGTTATGAAT AGAACTGCTA TAAACCCTCA 
351 AGTAAAAGTT TTGGTGTGAA GATAATTTTC TCAGCAAAAA CGCTGACAGG 
401 TAAI I I I ICT AAGTATTACT I I I I IAAAAA AGTAAAATAG CCTGTAGCCC 
451 CAGCTACTCA GGAGGCTGAG GCAGGAGAAT AGCTTGAACC CAGGAGGCGG 
501 AGGTTGCAGT GAGTTGAGAT TGTGCCACTG CATTCCAGCC TGGGCGACAG 
551 AGCTAGACTG TCTCAAAGAA AAAAAAAAAA AATAACAAAT AAATAAAAAG 
601 TAAAATGAAA GCATGTAAGT GTAAGATGAC TAGTTCAAGC AACCTCTCTT 
651 CAAGTACAGA GTATTCAGAG TAGAGATTAA AAGAGGTTTT CAAGGACAGA 
701 GAAAATTTGA AGTTTGAAGG CAGTTCCAAA GGAAGGCAAT GATTCTTAAT 
751 AAGACTGGAA GTTGGAAGTA ATATAAAAAG ATAAATCAGT TTCAAGATGA 
801 TTTTACTAAG CAGGCAGCCC TTAATTTACA AATTCTAGAT TCATACATAT 
851 CTTAAACATA CAAAATGATA TGAGGAGAGG TAAGTTCAGG GTCTGAGTTC 
901 CTGGCTGTTG TTGGAACTGA TTTCTGTGTA GTGATTCAGA AGATGTGAGA 
951 CACCCTAATT TACAAGTACA GAGGTATCTT CTTTTCTGCA AACAGCAGTA 
1001 CAACAATAGT TCCTCTTACG CAGCTGTGAA TGAACAGGAT TATTACAATT 
1051 AATGATATCT CATTTGATTG GCGCCTTAGA GAATTAAGAC CTTTCACACC 
1101 TAATATACAA CTTTGTTGTG AAGGCAGATA TTTATATTCT CATTTTACTG 
1151 ATGAGAGACT ACCCGGAGAC GCTATGTCAC ACCTGAAGGA TTAGGTACTT 
1201 TCTCTGTTAA GTCCAATGTT CCTTCCGTTA TTCCATGCTA GGCAGTAATA 
1251 AGTTCTGTCT TGCCTGAGTA ATAAGCTCCA AACCTCGGAA CTGCACCCAT 
1301 CTTGAGAAGG AGGAGGGCGC TGTGGI I I I I TCTGATAAGT GCAGCTGGCA 
1351 GACACTCTAT ACGCTTAATC ACGGGCAAAT CCTACCTAAG CTGCCTACCA 
1401 AACTAGTCCT TCTTTTCCCC GTTGCCCACG CAGATGGCTG TTGATCTTTT 
1451 CTGCAACAAA TCCAGGAGTT TCTCU I I I I GTTTTATAAT TGCTCCAATA 
1501 GATGCTTTAG GATTTAACTC TCTGCI I I I I AAAGCAGAAT CGCCATCCCA 
1551 GGTGTGCAAC CACGAAAAAA TTAGACATCC GTGAGAGACA ATGCCCTCCA 
1601 TGGCCCAGTT TCCAGGCAGA GAGAAGCAGC TCTGGGCTGA CCGCCAAGGC 
1651 TCCGGCCCGA GAGGGTCTTT AAGTGGAGTA ACCAGTCTTC AAGACCCCGC 
1701 TCCCAAGCCA CCGACGCGCT GACGCTGCAG CCCTGGACCT GCTGGGGGCC 
1751 TCTTCCTCGG ACCCGCATGC TGACAGCGGG ACTGGCAACT GGGCAGAGGT 
1801 CGACCCCGGG TCCGCACAGC ACCTCCCGAG ACCCAGCTCC CAGCTCCCTC 
1851 ACTTCCGGCT CTCTGGAGGC GGGCCCGGCC AGTGCCGCCG AGGCCAGCGC 
1901 GGCGAGCrCC TCCCCAGCAG CGGCGGGACG GCCACACCCT GCGCGCCGCG 
1951 CGGGCTCGGG TGGGGTCTCC GCTCCTGCGC CCTGCGCGCC GCAGCCGCAC 
2001 CCCCGACGGC GCCCCAAACG CTGTTGCGCC GCGCGCCCCG CCCAGCCCGG 
2051 CCTCGCGCTG GTCCCGGTCT CGCCCCGCAG CCCTCGATCT CCCGTGACTT 
2101 CCTCGGCCAG GCCGCCTGCG CCTCTGGGAC CATGTTGCGC TGGCTGCGGG 
2151 ACTTCGTGCT GCCCACCGCG GCCTGCCAGG ACGCGGAGCA GCCGACGCGC 
2201 TACGAGACCC TCTTCCAGGC ACTGGACCGC AATGGGGACG GAGTGGTGGA 
2251 CATCGGCGAG CTGCAGGAGG GGCTCAGGAA CCTGGGCATC CCTCTGGGCC 
2301 AGGACGCCGA GGAGGTGGGT CGCCGCCGGG GCGCCGCCTG AGCGTAGGGA 
2351 GGGCTGCGGG CGCTGGGGAC ACTGCGAGGA CCGAGGAGGG CGGCGGCTTG 
2401 AGGCGTTGCC AGGAGAGGAA GGAGGAACTG TGGCGCCCAG CGCTCCGGTG 
2451 GCTTCAGAAA CTCGGGCGTG GGGCCGCGAC CGGCGACCCC GGTAACAGAA 
2501 GTGGGTCATA ATACGAAAGT CTACTGGTAT TTGTCCAGAT AAAATGAGTG 
2551 TTGTGGACAC TCTGGCCCAC GGGCACTGTT AAATTTTTAA GACACTTTTG 
2601 TCCTGAATCC ATCCCAGGTT CTTTGTTTTC TGTTTTAATA CCTTGCAGAC 
2651 ATGTAATCCG TTTTAGCTGT CAGACTTCAG TGGGTCCCAA GTTTTGTATA 
2701 AAGGCGCACA CATTCGATCT CTTTCGAAGC TGCTTTGTTA CAGCAGCTAT 
2751 GTGTATTGTC TACTGTTTGA AAACTGTTTG AAAACCAATC GCGTGTTTCC 
2801 CCCACTTCCT GTTGAGAAGG AATGGCGGCA TTCCATTGTT TAAGACATTC 
2851 CTAGGTTAAT GCCCTAGGTA CATAAATTGA TCTGAAGGGT TGACTTGACC 




r. 

c 

n 

m 
o 



FIGURE 3A 



o 



FEB 1 9 2003 rc Docket No.: CL001103 

Serial No.: 09/777,921 
Inventors: Gennady MERKULOV et al. 
Title: ISOLATED HUMAN TRANSPORTER . 



2901 TGCGACTGAG CAATTTCATT TTCTCTGAGT CATCTTAACT GTGCCCCTGA 
2951 ACTTCTGCCC CTTTAGrAGG CTGGAGATAT GTGGAACTTC TCCAACCCTG 
3001 TTGAAGCGTT CCCTGACACT GGCATTCTCT TATCCAAAGA GGGAAAGTGA 
3051 TTAGGT TACT ATGAGGGCCA ACAACTGTTA TATAGTTATA TTTCACTTCT 
3101 CmTAATGT CTTTGGTAGT TATAGGCCTC TTCAGTTTAC TGTTTCTTCT 
3151 AGAGTCAGAT TTAGTAAGTT ACAAI I I I I I TTGAAACTGC CTGTTCTGTC 
3201 CAAGGTTCAT AATACTCACC GATGATTTTA TAACACTTCT GACTGAATCT 
3251 GTAGGTAGGT TCTCTATTTC ATTCCTCATA TCTATCCTTT TCTCCCCTTC 
3301 AATCTTGCCA AAGTTTTGTG TATTTTATTC ATACTTTGAA GGAACCAACT 
3351 TTTGGTACTT TGTGCTGATT GTCCCAGAAA TGGCCCAGTT GGAGTTCCCC 
3401 ACCATGTCCA ATCATTGGCT GGAAGCAGCC CAGGAAAGGG ACGACCTTGC 
3451 TGCAGTGCAT CAGCAGATGC CAGGGTTAGA GGCTAGAGAG TGGAAGTCAA 
3501 CTGTGTTCCT CACAGTAGGT GCCTTTGAAG GGAGATCTCA GTGGTACAAC 
3551 TCCATGGTCC CTACAATATA CAAAAGCTCT TTGGAGTGCT CAATGATTTT 
3601 TAAGATTGTA AAGGGATCCT GAGATCAAAA AGCTTGAGAA TTGCTGCTGT 
3651 ATCACCATTT TTACGTAACT GCATCATATT CTGTTATATG TTTGTGTCAT 
3701 AGTATATGTT ACCAATTCTT TTTAAATCAC CTTTTACTTT ATTGATAGTT 
3751 TAAAAACGAT TGTAAGTGAA ATTGCAATGG ATGTCCTTTG TATTCATTTT 
3801 CTCATTCTGG TCCAGTTACT TTCGTAGGAT AAATTTTGAG GAGTGGACAT 
3851 TGCTGAGTCT GAAGGTAACA CACATTTTAA ACTGGGATAC GTATTGCCTT 
3901 TCGGAAACCT TAGACCCATT TTCACTCTTT TGACTGACAG TGCTTGCTTC 
3951 TCCACATCCT CGCTCATTCA GGGTATCAGT CTTTGTAAAG TCTCCTATTC 
4001 TGCAGGTGAA ATTCCTTTTC ATTTCCTGTC TTAGTCCATT TAGTGTTGCT 
4051 ATAGTGGAAT ATCTGAGACA GGGTAATTTA TAAAGAAAAG ACATTTATTT 
4101 AGCTCACAGT TCCGCAGGCT GGGAAGTTTA AGAAGCGTGG TGCTGGCATC 
4151 TGCTGGACTC CTGGGGAGGG CTTTCCTGCT GTGTCACAAC ATGGTGGAAA 
4201 GTCAAAGTGG AAGTGGACAT GTGTGAAGAA GCAAAATCCG AGGGGTGTCC 
4251 TGGCTTTATA GCAACCCAGC CTCGAGGGAA CTGATCCATT ACTGAGGGAA 
4301 CTAATTCAGT CTCATGAGAG AGAGAACTCA CTCACTACTG CAAGAATCAC 
4351 ACCAAGCCAT TCATGAGGGA TCTGCCTCCG TAACCCTGAC ACCTCCTGCT 
4401 AGGTCC CTCC TCCCAACAGG GCCACATCAG GGATCAGACT TCAACATGAG 
4451 I I I I IGTGGG GACAAACAAA ACGTAGCACT TGCTTTGCCT TTTGGTTCTA 
4501 TTCACATCCT CCACAGGATT GCATTATGCC TACCCATTTG GTGAGGGCAG 
4551 TCTTCTTTAA TTGGTTTACT GATTCAAATG CTACCCTCCT CCAGAGACAT 
4601 CCTCACAGAC ACACCCAGAA ATCATGTTTT ACCAGTTATC TGGGCATCCC 
4651 TTAGTCCAGA CGAGTTGATA CATAAAATTA ACCATCACAC ATGGGATAGA 
4701 ATTAGGATTA CACAGTCAAC CTTTATGGGA GAAAATTTCA GAGGCATGTC 
4751 AGGGGTTTAT GTAATGTCAA GGAGTGAGGA CATTGGCTAC TTGAGCATAG 
4801 AAATGAGAAC TGTGGGGTGA CTCTTCGGTG GAAAGTTTCA AGGTAGTAGT 
4851 TTGTATCTAA GCCAAATACT CAGCTTGAAG CAAAATCTCT ATAAATTTTC 
4901 ATCTGATTTG ATCTCATCTC CGTGTTTCCA AGCATTTGTA ATGAATTGAG 
4951 CATTTAGAAG AGAACAAATT TCTGTTTAAG TTTCTTTAGA TTTTAGATGG 
5001 AAAGAATGTA GAAATAAGAG TAGAATGTAG AAATAGGTAT AAAGAATATA 
5051 ATAGCTAACC ATTACTAAGT GTTCCAGAAT TATCCAGGGA AGAGAAAAGA 
5101 ATTCAAGGCA AGTCCTGAGA CAAAATTAAG AACCAATTGG AAGTGAAAGC 
5151 GCTACATTTT I I I I I ICTGG TATGACCTTT CTTTTCTATA TGTTCCAAAT 
5201 CTCCTCACTA TGAAATTAGT GAAAAATTAA AGTTAAAAAT TAGAGAAAAT 
5251 TCACATTAAG TTCTCCTAGG ACTCAGTAGT ATAAGGGTAT AGACTGAGAG 
5301 TAGAATGTAG TGTGAGAACA AGGAGATACA GTATTTAACC ATTACTAATT 
5351 CTCTTATACT TGTCTAGTAA TCCTATTTCC TTTTAAAAGT CTTCAGTTAT 
5401 TTTCTCTTTA CGCACCTCCT TCTCCCTCTT GTCTTCCTCC TTCTACCCCC 
5451 ATCTTTCTTC CTGTGGAGCC TTCATGAATG GGATTAGTGC TTGTATAAAA 
5501 GTGACCTGGA AGACCTTCCT TGCCCCTTCC ACCATGTGAG GACACAGTGA 
5551 GAAAACAGTG GTCCATGGAA CCGGAAAGTG GGTCCTCACT AGACAGTAAA 
5601 TCTCCTAGCA CTTCGATCTA GGACTTCCAG TGTCTGGAAC TGCAAGAAAT 
5651 CAATGCTTAT TGTTTAAGTA AGCCAGTAGT ATTTTTGTCA TAGCAGCCCA 
5701 GTTGGACTAG GACAATTACC AAGAGCAAGA AGGGAAGCAG CAAGCTACAA 
5751 GAGAGTTCCG TCCTTGGTGT AAATTGACCG TGTAATCCTT GTCAAGTTTG 
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5801 AGCCTTACTG GAGCTTTACT TTCTTATTCT TAAAATGCAG ATATCTTGCC 
5851 TGCATCCTGG ACAGAGCTTT TAACAAGGTC ATATGTTGCA GAATATGAAA 
5901 GTTCATGTTA AAAAACCCTT TAAAATGTGG TATCCCATTT ACTAGCTGGT 
5951 GAACTTCTTG AGGAACCTCT GTGCCCATGG GTATGAAGTG TATGCTGAAT 
6001 GATCACCCAA TGTTAGAGGA GTGGGTGGAC TGGTAACCTG ATTTAAGGGC 
6051 CATTCTAACT CTTACATTCT ATGAI I I I I I TAATTCTGTC TTTAAGTTTT 
6101 TACATTTACA ATCACAGAAA AAATAGTCAC ATAGAAGAAT AGTAGCTTAG 
6151 CAAATGTTTA TTGCATTGAG TGGAATCAGG ATTTCACTCG ATTAAGTAAT 
6201 TCCTCTGTTA ACAAAGA GGG TTCATTTCAT TTTTATTTCA TTAATATTGC 
6251 I I I I I I I I I I I I I I I ICTGG AGACAGAATC TTGCTCTATC ACCAAGGCTG 
6301 GAGTGCAGTG GTGCGATCTC GGCTCACTGC AGCCTCTGCT TCCTGGATTC 
6351 AAGCGATTCT TGTGCCTCAG CCTCCCAAGC AGCTGAGATT ACAGGCACAT 
6401 GCCACCACAC GTGGTTAACT TTTGTATTTT CTAGTAGAGA TGGGATTTTG 
6451 CCATGTTGGT CAGGCTGGTC TTGAATTCCT GGCCTCTAGT GATCTGCCTG 
6501 CCTCTGCCTC TGAAAGTGCT AAGATTACAG GCATGAGCTA CCATGGCCAG 
6551 CCCATTTCCT TAATATTTTA ATTGTCAGAC ATGTTATGGT TTCTGGCACA 
6601 ATATTAAGAA GACATGATAT GAAATCACAG GGTGAATTTT AGGGCATCAC 
6651 AACAGAAAGA TTATGGTATA AGAAAAACAA TGGAATTCCA ACTACATTTC 
6701 TGTCAAATGT TCTAAAATAT ATAAAATCTG TATCTTTTGT GTTCTCTCCT 
6751 GATTTATATT CTAAATTTGA TGTTATCCTT CTCTGCAGAA ATAAAGTGTC 
6801 TGAAAGAATG AAAAAAATGG AAGAATTCTT TAGTAAGGTA TAAAATACCC 
6851 TTTCTATCTT TGTAGCATTC TAAGCCTTTT GTCACCTTTC CAAACTCCCA 
6901 ACATGCCATA TTCCCTGACT AGGCCACAGC CATGTACATT GATCCCTTTA 
6951 TTTTCTTCTC TCTGCCTGAG ATTTCTCTCA TTCCCCCTTC TCTGCCTGGT 
7001 ATATGATTGC CCATTGTTTA AGGCCCCAAC TCACCTTTAT AATCTTCCTA 
7051 GCCCACTTTC TTTATCGGTA TTCCAGAAAA AACAAAAGAA GCTTCCACAA 
7101 GACAACATTC T GTAATAC AC TGCTTAACTT CTTTTGACCC TGCTGAGTTC 
7151 AAAAATCTTA TCTTTTTAAG GATTGAATGG AGTCCACCAA GGTATCTATA 
7201 TTTGACAGGA TTTATGAAAA CAAAAGGATT TGTTGAGAAA GTTTGAAGCC 
7251 TAACTCTGAA ACGTGGATCA TAGTGTTTAC TACACATTAA CTGTTTTAGT 
7301 GGATGT AATA GTTATTATTA TAGGCTGTGG AATCAGAACA GGGTTCAAAT 
7351 GTTTTCACCG CTTGCTAGAC TGTGGCCTTG GGCATGTTAT TTAATGCCTG 
7401 GAGGCCTCAA ATGTTAACTA GGAATGGTAA GACCTACCCA GTAACTTAGC 
7451 ATAAATAGTA AATTCATTCA TTTAATGTTT TCAAACAGTG CCAGACATTG 
7501 TTTAATGAAC TGGGGATATA GTGGTGAACA ACACTGACAG CGI ICI I CAT 
7551 TGTATTCTCA AAACCCTCCC TATAGTAAGT AGGTCTGTGT GTGTGTGTAG 
7601 GTGCATGGGG AATAAAAAAT AATAAGCAAA TAATGAACAG GGTAATTTCA 
7651 AAAAGCAGAA AGAGCTATTC AACAAAACTA CCTGCCTTTT ATTAGATGAA 
7701 ACTCTCAACT CTATGGTTTG TTCTCTCCTG TCAATTCTGT TAAATGCTGT 
7751 CAGCCTGTTT TCCTTATCAC CCTGGCCACG ACTTCTGTCT TTTCTGCTTG 
7801 GTCCTGTAGA CTCTAACCCA AGGCTCATTC TCTGCCTGGC TATCTGCCTT 
7851 CTGTGGCTCT TTGCCACTAC CTACATTTTC TGTGTTGCAC AGGGAAGGAC 
7901 CATTCCCTGT GGACCATAAA ATTCTCTTTT TGAAAGAATT CATTCTTGAT 
7951 TGGGCCACAG CACATCTTGT GAAACAGCAT TAGACATTTG CCACTGCTCA 
8001 GCAGCTCTGG GGGAAAATGT TTACTGAGAA GCGTACAGTA GTTTTTTTGA 
8051 CTAACCATGG TGCAACCTCC TCCCAGAGGG AAACCTATGA GTATTTCAAG 
8101 GACATGTGAT GGTCTGTTTT TGTCCCCAGT ATCTGACATG ATGGGTAGTG 
8151 TAGAGCAAGA GCTTACAGAT AATGGCTAAA TTAAATTTTC TTTTTGAATT 
8201 TTAATATTCA ACTTTTTAGG GTACCCAATC TCCATATTTA GGAAAATAAA 
8251 TTACATAAAA AGTGGAGAGT TTTTATTGTG AAACTGCACC TCCATATTCC 
8301 CAGTGGTGCA GGATGAGGGA GCACAGGTGT TGGTCTGGGG AAGCCAGGGC 
8351 CCTCTGTGGT TCTGGAGGGT GAGGATTAAG AGGAAGCCTT AGATAGTATT 
8401 TATGAGTATC TGCTGACTTC TCTCTGGGAC CCAAGATCAC TGAACTTTTG 
8451 CCTATTTTGA GATCATCTTT CCAATCCAGC CACTAACAGC TGAAGGATAG 
8501 GCTTGCCCTG GAGCCATTGT AGTGGTTGGA TGAAGATAAA AGATAAAAAA 
8551 CTGTGAGGGG AGGTGTCACA GAAGAAAGGG CCCATGTGGG CAGATTTTCA 
8601 TTCAATTCCT AGTCTTTATT ACAGCAATTC TCCAGTGCTG CAACCTTAGA 
8651 AAAGGATTCC TACAACACAA TGTAGGTACC CATCAGCAGC AGATTGGATA 
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8701 AAGAAAATGT GGTACATACA CACCATGGAA TACTATGCAG CCATAAAAAA 
8751 GGAGCAAAAT CATGTCCTTT GCAGCAATAT GAATGCAGCT GGAAGCCAAT 
8801 AACTTAAACG AATTATTGTA GAAACAGAAA AACAAATACT GTGTTCTCAT 
8851 TTACAGGGGG AGCTAAACCT TGGGTAAATG GGGCATAAAG ATGGGAACAA 
8901 TAGACACTAG GGACTCCAAA AGGGGGGAGG GAGGGAGGAG GGCAAGGGCT 
8951 GGAAAGCTTC CTACTGGGTA CTTTGTTCAC AACCTGGGTG ATGGCACGAT 
9001 TAGGAGCTCA AACCCCAGTA TCACACAGTA TACCCTTGTA ACAAGCTGAT 
9051 GGTGTAACCC CTGAATCTAC AATAAAATTA TTTTATTTTA AAAAATCATT 
9101 ATAAGGATTT TTAAAAAGAA GGATTCCTAG ACAGGTGCAG CCAAACAATT 
9151 I I I I I IAAAT GTTGGCAGGC CGCCACCGCC AGTCACTTAT GCTGCAATAG 
9201 CCCATGTCCC AACATTCCCA ACCTACTTCT CTCCAAAAGA GAAGCTATAC 
9251 TTTCAGATGG CCCTGTGCTG GGTTCTCCCT GGAAGTTTCT GGGGAAAGGG 
9301 GCTTGAGTTG CCCCGACTGG ACTCTTCCTG GAGTGGGAGC CGGGGCTTCT 
9351 GATCAGACGT GAGTGAGGCA GGAACTCCGC GGTCTCCCAG CGCAGCCCAG 
9401 AGTGCGGTCC CACGCAGGTC CCGGGTCCTG CGCGCTCGCG CCTTTGCGCT 
9451 GAAGCCGTTA GGATGAGCCC TCTCCTTCCA GAGCTTTAAC CGATGAAGGT 
9501 GCATTGTGTT TGGCGCCCCT GAGGAGGATG CTGTCTTAGG CCTCTTCCCA 
9551 CTGGACGTGT GTGGTGGGCA GAGATCCCGT TCGTCGGTCG CACTTCCACC 
9601 CCGCTGGGGC TCACTCAGGC CGCGGAGCTG CGAGGGAGAC ATCCTCGATG 
9651 GACTCCCTCT ACGGAGATCT CI I I IGGTAC CTGGACTATA ACAAGGATGG 
9701 GACCTTGGAC ATTTTTGAGC TTCAGGAAGG CCTGGAGGAT GTAGGGGCCA 
9751 TTCAATCTCT AGAGGAAGCG AAGGTGGGTC TCACTGGGGC TGTAATCAGA 
9801 GAGACGTTGG GGCTGGGAGC CCTGGAGAGG CATTGGGCAG AGAGGGCAAA 
9851 ATTTACATGT TGTCAAGCTT GACCTGGGCC CACTGCAGTG TTCAGGTGGT 
9901 TGACCAGCGT TACCGTTTAT TAAGAATAAC AACACAGCTA ACACATTTCT 
9951 CAAGTATTTT TCTCCGTTTT CTCCTTGGCT GTAGTAAAAT CTCCAACTTC 
10001 AGATTGCTCT CAAGATGTTG GCTACATACA GCCTTGTCTT AGGAGTCACC 
10051 TTGTTCAATG TGCTCACCTG TCATTAGTCA CCCAGAGGGG CGTCTAGGCT 
10101 AAAGATGCGC CCTCCCCAGT TCAGAGAACT GGAATAATCA CTCTACGTGT 
10151 ATTTGGGAGT GGGGTGGTGA TTGGAAATTT TCTGATGTTA TGTTTTGGTT 
10201 TCTGTTCCTG GAAGGGGGCA GTGGAAGTGG CTTTTACTCT CGGGTTTCAC 
10251 TAGTGCTGAG GTTTCCTCAT AATATGCCTT AATTGATAGA CCCTAGTTAT 
10301 CAGTACCGAG CTTAGGCTAA CCCTTCTCTT CCCCAGAAGG CTAACCTACA 
10351 GGCTCCTTCT CAGCATGTTG TGCTTCGTAC ATACTCCTAT TGCAGTATTT 
10401 CCAAGTCATT TTTCATTTGG AATTTATTAT TGTATATAAT AATTACTTTA 
10451 TAAGTATATT TGCTCTTTGG ATGTTTGACC CGGTAGACTG GGAGATCATG 
10501 AGCATGTGGA CTATTGAGTT TATTTTGGAT AATTGGTACT TCGTGCCCAA 
10551 AAAACTGTCA GTTGAGTTCT GTCATGTTGA AATTTAGTAA AACTCTTTCT 
10601 ATTAGCCATG TGAACTTTGG GAATATTGAA GCATCCATTC AGTCATGGGT 
10651 CAGTTCTAGT TTGAGCACAT TCTATATTCC AAGCCCCATA CCCTGGTATC 
10701 CTCATCTGTT ATATCAGAGG CCTGGACTGT GTACTTTCTG TGGACCAATT 
10751 CAGTCCAAAA TGTTATTTCT GCAAAGCTTA TCTGGATTTT TAATTCCTAG 
10801 AAAAAAGCAG TGTTTCTCCT TTTAAAGTTA AGTGTTCTTG TTCAGGTGCA 
10851 GTGGCTCATG CCTGTAATTC CAGCACTTTG GGAGGCCAAG GCAGGTGGAT 
10901 CACTTGGGGT CAGGAGTTCA AGACCAGCCT GGCCAATATG GTAAAACCCC 
10951 ATCTCTACTA AAAATGCAAA AATTAACCGG GTGTGGTGGT GGGTGTGTGT 
11001 AGTCCCAGGA GGCTGAGGCA GGAGAATCAC TTGAGCCTGG GAGGCAGAGG 
11051 TTGCAGCAAG CTGAGATTGC ATCACTGCAC TCCAACCTGG GTGACAGAGT 
11101 GAGACTCCAT CTCAAAAAGA AAAAAAAAAA GTTAAGTGTT CTTCATATTT 
11151 GTTTAAAGAC ACTCTTATAT TTAGATTTGC AAGTGTAAGT TGTATTTGTT 
11201 TATTTGATAC AAACTAGCCT TTCATAAGAA ATTCTGGGTT AGCTATCAAG 
11251 TCGAATCTTT TGAAACACAT TTCTTCCTTA TTGAAACAAA AGGTTTGTAG 
11301 AGCTGTCTTG CAI I I I IGGC AAGGACGCTT TGTGTACCTA GTGGTGACTG 
11351 AGGAGGGTTC ACATGTCAAA ACCCAAGGGA GGGGTGTCCC CAGAGAATTC 
11401 TGCACCAACC ACACAGAACA TTCTGTTTCA GAGGAGCACC ATTGTGACTT 
11451 TTCCTCAAGT GGCAGTCACA TCGTTAGGAG GTTTTGATGT GAGGTCTCTT 
11501 CCCACACGTC TCCACCTCCC CAGTAGGAAA ATTTGTTTAT ATAGACAAAA 
11551 CTCAACTGAT TAAAAAAAAA AAAAAGAAAT GATACTTACA TTGTCGTGTT 
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11601 AAGATACAAA AGCAATAACT TTTTATTGTG AAAATAGTCT (all I I IG AAC m 

11651 AATATATTGT I I IGI I I I I I CCTGTGAAAG TTGAGAAACT AAATATACGA =E Tl 

11701 AGAGATAATG GTCAGACCAT AAATAAAAAT AGAACTTTGA CTCAAAATTT CD m m 

11751 ACAGCAGTCT GCCCAGAAAA CCAGCCCTTT ATCTAAAATA AACAGACCAG ^ CO JJI 

11801 GAAACCAGCC TGTTATGTCA GACTTATAGG AAGTCAGGTT GCTATCTCTA —4 to C } 

11851 GAGACAATAC ACAAAGCTAT GCAATAACTG CTGTAACAGC CCCAAATGGT 33 4^ IT| 

11901 CAGAATTTGA TTAATAACCG ACAGCCCCCC TAAI I I I I I I CTTCACTNNN r-o !p 



11951 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNTTC 
12001 ACCGCTTGCT AGAACTGTGG CCTTGGGTCA TGTTATTTAA TGCCTGGAGG 
12051 CCTCAAATGT TAACTAGGTA ATGGTAAGAC CTACCCAGTA ACTTAGCATA *£> 
12101 AATAGTAAAT TCATTCATTT AATGTTTTCA AACAGTGCCA GACATTGTTT 
12151 AATGAACTGG GGATATAGTG GTGAACAACA CTGACAGCGT TCTTCATTGT 
12201 ATTCTCAAAA CCCTCCCTAT AGTAAGTAGG TCTGTGTGTG TGTGTAGGTG 
12251 CATGGGGAAT AAAAAATAAT AAGCAAATAA TGAACAATAA MTTATTTTA 
12301 TTTAAAAAAA AAGAAATGAT ACTTACATTG TCGTGTTAAG ATACAAAAGC 
12351 AATAACTTTT TATTGTGAAA ATAGTCTGTT TTTGAACAAT AT A I lb I I I I 
12401 GTTTTTTCCT GTGAAAGTTG AGAAACTAAA TATACGAAGA GATAATGGTC 
12451 AGACCATAAA TAAAAATAGA ACTTTGACTC AAAATTTACA GCAGTCTGCC 
12501 CAGAAAACCA GCCCTTTATC TAAAATAAAC AGACCAGGAA ACCAGCCTGT 
12551 TATGTCAGAC TTATAGGAAG TCAGGTTGCT ATCTCTAGAG ACAATACACA 
12601 AAGCTATGCA ATAACTGCTG TAACAGCCCC AAATGGTCAG AATTTGATTA 
12651 ATAACCGACA GCCCCCCTAA I I I I I I ILI I CACTTCCAAC TTAGGACGAA 
12701 CCAGAGAAAG CTAAATATGC AGCACCTACT AATCAAATAG GGTGCCGCGT 
12751 TTCTAATGAA CCCTCCTACA GCTTCCCCAG GCCAGCAGCC CCCAATCAGG 
12801 AAACGCCTGA AGCCTTCCCT TTTTCTCACT GTAAAGCTTT CCCACTCCTC 
12851 TGCCTGGCTT TGAGTCTCTG TCAATACACA AGTGAGGGTG TCTGACTCCC 
12901 TT GCTATA GC AAACTCGGGC CAAGTAGATT TTACTTTTCT CATTTGATTG 
12951 GTCTTTTATT TCTAGAAGGA ACATACAAGA AAATTTAAAG GGGAATCCAT 
13001 TCCTAATCTT TCATATTATA GTAGTCCCCT TTTATCTGCA GGGCATATTT 
13051 TCCAAGACCC CCACTGAATA CCTGAAACTG TGGGTAATAT TGAACCCTAT 
13101 ATATACTCTC TCTATATATA CATATATATA TATA I I I I I I AAI I I I I I 1 1 
13151 TACTTTATCT TTAATTAGCT TTAGCTCTTT I I I I I I I I I I TGAGATGGAG 
13201 TCTCACTCTG TCACCCAGGC TGAGTGCAGG GGTGCAGTCT TGGTTCACTG 
13251 CAACCTCTGT CTACCGGGTT CAAGCAATTT CTTGTGCCTC AACCTCCGGA 
13301 GTAGCTGGGA CTACAGGCGT GTGCCACCAC TTCCTGGCTA ATTGTTTTAA 
13351 ATTTTAGTAG AAACGGGATT TCACCAAGTT GGCCAGACTG GTCTCGTACT 
13401 TCTGACCTCA AGTGATCCGC CCACCTTGGC CTCCCAAACT GCTGGGATTA 
13451 CAGGCGTGAG CCACCATGCG CCCAGCCATA GACTATATAT TTTTGATCTG 
13501 ATAACTGGTT CAGCTACTAA GTGACTAACA GGCAAGTAGC ATCTATAGTG 
13551 TGGATATGCT GGACAA AAGG ACATTCACCT CCTGGGCAGG ATGGCACAGA 
13601 ATGTTGAGAG ATTTTATCAT GCTACTCAGA ATGGTGTGCA ATTTAAAACT 
13651 TATGAGTTGT TTGTTTCTGG AGTTTTCCAT TTAATAGTTC AGACCATGGA 
13701 TTGACCGCAG GTAACTGAAA CTGTGGAGAG TGAAACTGTG GATAAGGGAG 
13751 GACTATTGTA TTGTTAAGTC AGACTCATTA GGCAATCATA ACTCTTGATT 
13801 TGCCATCAGA AATGCTGCAG AAATATGGGT TAAAAAAAAC TGTTCAAAAA 
13851 TAGGGTCAGG GATGTCCTTT AACTTGTTAC TTCCAAAATG TTAGTGAAAA 
13901 CTGTGGCCCC AAAGAGTGAA AGGAACAAAT GACTAAGAGA AAATCTTGTT 
13951 TTCAGGATGA CAGATTAAAA AAGAAGCAAC TTGCTGAAAC ACTGAAAATC 
14001 TCTCCACTTG TAAGATAACA CAAAACTGGC TAAAACTGGT TGGAATGAAT 
14051 ATGGCCAACT CAAGTCTGCA CAGAACTAAC TTGGTGATGT TACAGCCCAA 
14101 ATTTCCACCA CATATTTTAT ACTAACTCCC CCCGGATTTT CACACATGAT 
14151 CTGTGAGGTA GCATGAAGAG GTAACTATGC ATGCCTAAGG ACTTGGGAGA 
14201 CCTC CCCATT TCCTTCCACC AATCACCCAC TAATCCCAGA ATCCGCCCCC 
14251 AAACCTTTTC TAATAACTAC CTTAAAGCCA GCATAGGGAG ACAGATTTGA 
14301 GCTGGACTCC TGTCTTCTTG TGGGTCACCT TGCAATAAAA AGCTTTTCTT 
14351 TTCTCAACAC CTGGTATTAT AGTATTGACT TCTAGTTCAT CGGGCAGCAA 
14401 GCCCCTTTTG GTCGGTGACT ATTCTTGTTC GCTGATATTT CCATTGGCCA 
14451 AAATATAAAC CTCTTAGATG AAACTTCAGT ACGTAAATGG CGCCACAGAA 
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14501 TGCTGTGACA I I I I ICTCTT GGATTATAGC AQGTTACTTT ACTGAATACC e> 

14551 GTAGGCAGTT ATAACACACT AAGTATTTGT GTATCTAAAC ATAGAAAAGA ^ ^ 

14601 TACAGTAAAA ATATGGTAAT I I I I I ICAAC TTTTAGTTGA GATTTGGAGG rr\ I sf 

14651 GTATGTGCAC ATTTGTTACA AGGGTATATT GCATGATGCT GAGGTTTGGG ^ 03 J£l 

14701 GTACAATTGA ACCCTGTCAC CCAGGTAGTG AGCATAGTAC CCAATCGATA FR O 

14751 Al I I I ICAAC CCTTGTCCAT TCCCTCCCCG TTCTTGTAGT CCCCAGTTTC fff fTf 

14801 TGCTTTTCCC ATCTTTATAT CCGTGTGCAC CCCATGTTTT GCTCCCATGT gj g p 

14851 GTATGTGAGA ACTTGTGGTG TTTGGTTTTC TATTTCTGCG TTGATTCGCT ^ Q ffS 

14901 TAGGATAATG GCCTTCAGCT GCATCCATGT TGCTGCAGAG GACGTGATTT ^ ' ' ' 



14951 TATTCTTCTT TATGGCTGTG TAGTATTCCA TGGTGAAAAA TATAGTACTA 
15001 TAACCTTACT AAATCACTGT CATATATATG GTCTATCATT GACTGAAATG 
15051 TATACAGTGC ATGATATATA TATATATATA TCTATAATGT CTTATCCATT 
15101 TCGTGTATTA TGAGATTTGA TTGCTAATAT TTTATACAGG AGTTTTGCAT 
15151 LI I I I ICACT AGTTGACATT GCTTGTAATT TTCLI I I I I I TGTGATGTCC 
15201 CTGTTAGGTT TTAGAATCAA GTGTATACCC GCCTCATAAA ATGGGTTGGA 
15251 AAATGTTCCC ACCCTTTCTG TTCTCTGGAA AATTGGTGTT TTTTTCTTAA 
15301 AGTTTGGTAG ACATTATTGT TAAAACCATG GGGTCCTCGA TTTTTCTTCA 
15351 TGGAAATGTT TTCAAATTAC ACTTTAAATT TCTTTAAAAT CTGAGTATAG 
15401 GGCTATCAGA CTTTCTGCTG TCTTATGTCA GTTTTTAATA AG I IGI I I I I 
15451 GTAGGCGTTT GTTATCTCAC TTTCATATTT TTGATATAAA GCTTTTCATA 
15501 ATATCATTAA TGTCTATAGT GTCTAGTAGT TTCCATCTTT ACTTTCTCAC 
15551 ATTGGTTATT TGCCAGTTTT AGGAGTTTAT CAATTTTATT AGTCTTTTCA 
15601 AAGAACCATC TTTTGGCTTT GTTAATCCTC CCAATGGTGT GTTTTCTTTC 
15651 TCATTACTTT TTGCTCTTTA TTTCCTTCAA LI ICI I I I I I GCTTAATTTT 
15701 AAAATAATTT CTTGAGATTG AGATAAGCCT CAATGATGGG TCACCGATTT 
15751 CCAGTCTTTC TTCTTTTCTA ATTATGCATT TTAAACCAGA AATCTTTCTC 
15801 TAAGTGTAGC TTTAGTTGCA GCTCACAAGT TTCAGATCTG TCTCTCAGTC 
15851 TGGAGGTTGG AGATCTGACC ATGACCATGA AACCATCCAG TCACAATGTG 
15901 GCATTATTTT TTTAAI I I I I I I I I I I I I I I TTGAGATAGA GTTTCACTCT 
15951 TATTGCCTAG GCTGGTGTGC AATGGTGCGA TCTCGGCTCA CAGCAACCTC 
16001 CACCTCCCAG GTTCAAGCGA TTLI I I IGCC TCAGCCTCCC AAGTAGCTGG 
16051 GATTACAGGC ATGCGCCACC ATGCCCAACT AATTTTGTAT TTTTAGTAGA 
16101 GATGGGGGTT CTCCATGTTG GTCAGGTTGG TCTTGAACTC CCGACCTCAG 
16151 GTGATCCGCC CACCTCAGCC TCCCAAAGTG CTGGGATTAT AGGAATGAGC 
16201 CACTGTGCCC GGCCCAACTT GGCATTATTT ACCCAGAAGA GCATGACCAT 
16251 GAGAACAGTA GAATTTGTAA GCTTTGAGTG GGTGACTATG AGTGTCATAA 
16301 TAGGTAGATA GGTTATATTT TGGGTGGTGG TAGGAGAGGG CTTACAGTTT 
16351 GCTATGACAG LI I I I IATAT GGATCATCCT TAGTAAAAGA TTATTTAATT 
16401 TTTGA AATCA AAGGGGAAAA CACTAGTTTA GGCTTTCTTC I I ILI I ILI I 
16451 TTTTAGAGAC AGGGTCTTGC TCTGTCACCA GGTTAGAATG CAGTGGTGCA 
16501 ATATTGCTCA CTGTAACCTC AAATTCCTGG GCTCAAGTGA TCCTCCTACC 
16551 TCAGCCTCCA AGTAGCTAGT ATTTACAGGC ATGCACCAAC ACATCTGGCT 
16601 AATTTTAAAA Al I I I I IATG GAGATGAGGT CTCACTATGT TGTCCAGTCT 
16651 GGTCTTGAAT CCTGACCTCA AGTGATCCTC CCCCATCAGC CTCCCAAAGT 
16701 GCTGCAATAT TTTAAATCCT GTGGTAGGTC AAGTGGTTGT CTTCTATCTT 
16751 GGGGTTTATA AAGTACATGT CAAGAAATTT AGGGTATGGT TAGATTAGCT 
16801 TTAAAAATGT CATGTTTTAT AAAAATCAAT GCATCATTTT TCTGATTGAA 
16851 AATTTAACAC AAGACTCAGA ATLI I I I IGC AGTAGTGGAA TTACTTTTAT 
16901 TATAGATCTT TGCGATAATG AATGATGATA CATCTGGCCA AAAATAGGTA 
16951 CTATAGTCTT TTAGGAAAAC AGCTAATCTG CTTGAAATAT GTGTAGAAAT 
17001 AATTTAGTGC ATCAG CCCAT ATTGGCAATA ACTTCTCTCT AAI I I I I I I I 
17051 TATAGAAAAT TTTTACTACT GGAGATGTCA ACAAAGATGG GAAGCTGGAT 
17101 TTTGAAGAAT TTATGAAGTA CCTTAAAGAC CATGAGAAGA AAATGAAATT 
17151 GGCATTTAAG AGTTTAGACA AAAATAATGA TGGTGTGTCT TTCTTTTGTA 
17201 TTTATCACCA GCTATGAAGA AGCATTTATC ATGCTTTCAA GAGTCTAAAA 
17251 GGATGCTTAT TTAATCTCTC TGGTTTTAGA TGATAATTAT TATTTGTGTT 
17301 AATALI I I I I TTTAGTAATG TGAI I I I IAT GTAGAGTTTA TATTATTTAG 
17351 TGAAGAAAAC TTATAGATAG LI I I ILI I I I TCATTACTTT GAAATGTAAT 
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17401 GAATTACATT TCTGAATTAA AMCTGTGGG CAGGGCCTGT TGTAAATGTT 
17451 AACTATQGAA CATTATGCTG ATTTGAGTTA AACCTGTAGG TTAAAAATAA 
17501 TAATTATATT TTCTTGTCCT CTGGGTAAAA TGAGATTTCT TTTTATTTGT 
17551 ATAGAAGAAT GACAGTTGTG TCATCTAAAA TTTAAAAAAC TTTCAGATTA 
17601 TCTTGCATCT GTTAGTTTTT TTGGAAGAAT TAATTTAGAG AAGATATCTC 
17651 TGATCCTGGA AATTAGGGAA AAATAGCATA TAAACGTTTA AGTGTGTACC 
17701 TTCTGGTTAA GATTATGACT TCTATATTTC GATTAATAGG TTGGAGTTTG 
17751 TCTTAATCTG TTTTCTGTTG CTGTAATGGA GTACCACAGA CTGGGTAATT 
17801 TATGAAGAAA TGAAATTTAT TTCTTATAGT TCTGGAGGCT GGGAAGTTCA 
17851 AAGTTGAGCC GAATCTGGTG AGGGCCTCTT ACTATGTCAT AACATGCTAG 
17901 CAGGCATCAC AGAGCAAATG CACTACCTCA GATCTCTCTT CCTCTTCTTA 
17951 AAAAGCCACT AGTCCCATCA TGGGGGCCCT ACTCTGAAGA CCTTATCTAA 
18001 TTCTAATTGG AAATAGGGTC TTGAAGCCCT CATCACTAGA GGTAACCTTT 
18051 AACAGGAAGA GAGAATTTAT AAAAATTATA ATGCAGCACC AAATCCCTCC 
18101 CTACTTGTGA ATAGTCAAGG TCATTTCATT TACAGACTTG TTATTAAAGA 
18151 AACAGGTTAA ACAAATAGAT TGAGAGGAAA TGTGGTTCAT GTCTGAGATC 
18201 AGCAAACTTT TTTGTCCAGA AGTCCAGATA ATAAATATTT TAGCTTTGTG 
18251 GGTCATGTGG TCTCAGTTGT AGCTACTTGT CTCTGCTGCT GTACCTCAAA 
18301 AGCAGCCATG GATAATATGT AAATGAATGG GGATGACTGA TTTCCAATAA 
18351 AAACTTTATT TACAAAGATA GTTAATACAC CTTATTTGGC TTGAGGGTTA 
18401 TAGTTTGCCA TCCCCTGATT TACAATGAAT ATTAAAGTTT AATTCAAAGC 
18451 AAGTTCCTTC AAACAAACAA ACTAAACTCT AGATGATTTT GAAGATTATT 
18501 CACATCTGTG ACTCTCAGCC AGGAAGAGCT GAGTTTGGGT TGGAAAGTAG 
18551 TACTATTGGA ACATTTGTTG CCCATAAGCC TTACAATATA TGCCCCTAAG 
18601 TCTAGCCTTA GTCCAGTCTT CTAGCAAAAC TCAGTTTTCT TTCTTCTCTG 
18651 CAAACTTTCA TTCCAACATC GACCCTCTGC AGTTCAGATT GTCTTGCAGG 
18701 TCAGATTGTC TGTGTGCTGC TATGGTAGGC AGTAGCTGAG AGATGGAGCT 
18751 ACCTTAAGAT CAATTGCCAG ATAATCAGAG GTCAATTATC CCAGTGCATA 
18801 AGTAGTGTAC ATATCAATTG TTCATTTTAT AAAATTCTAA ATGAACCAGA 
18851 GGCAATAATT AAAGATGAAA TTTTGATGGT ATATTTGTAG GAAATCTACA 
18901 CAATGTTTCC CTAATTTCCC ATGTTTGTGT ATTTTAAAAC AATGTGGCAT 
18951 TATTGGTTCA TAI I I I I ATT TTTTAGACTT CCTTAATGCA AAACATATAC 
19001 AGTTGATCCT CATTATTTGG GGATTCTGTA TTTGCAAATT TGCCTACTCA 
19051 ATAAAATTTA TCCCCAAAGT AACCCCAAAA TATATACTCA CAGTACTTTC 
19101 CCAGGCATTC ATGGACATGC ACAGAGCAGT GAAAAACTTG AGTTGCTCAG 
19151 CATGTACATT CCTAGCTAGT AGAATAAGGC AATACTCTGC CTTCTTGTTT 
19201 CAGCTCTCAT ACTATTAACT AGCAAGTATC CCTTTCAAGG TCTATTTTGT 
19251 GCCAGI I I I I GCAI I I HOT ATTTTTGTTG GTAATTTCCT TTTTAAAATG 
19301 TTCCCCAAAG GTAGTGCTGA AGTGCTGTCT AGTGTTCCTA AGTGCAAGAA 
19351 AGCCATAGCA TGCCTTATGG AGAAAATATA TGCGTTGGAT AAGCTTTGCC 
19401 CCAAATTCAA TGTTAGTGAA TCAACAGCAC ACATTAAATG AGGTGCCTTC 
19451 AAACAGAAAC AGACATAAGA CATGGTTATG TATTAATCAG TTGATGAAAG 
19501 TGTTGTAATC AGAGGCTCAC AGGAACCTAA CCCTGI I I I I CCTGTAGGAA 
19551 CAATGGTTTG GTATTTGCTA ATTCAGTGTT TGCAATGAAT ATAGAACTTT 
19601 ATGGAAGATG ATTGCTGTGA ATAATGAGAA TTAACCATAT CTCTTTAAGA 
19651 GTGCATTTCT AAAGGAGAAT ATTCAGAAGG GTATTTGCAT AATTTCTTTA 
19701 CTAACAGATG CTGCCTCTCA CTGTCCTTAC ATGGTCCAGA TTCTCATGCT 
19751 GCTCCTTCCC TCTCCCCAGG AGGATTCTCT CAGAATCCTG TCATCTCCTC 
19801 CAGGGTCCTT TCTCCAAGAA AGTCTATCCT TTCACCACTA ACAGTAATTT 
19851 TGGTCTTCCT LI I I I ICTGG AGAAGTCAGC TGTTTATGCT GCTTCAGCAC 
19901 CAGACCCTCT CTTACTTTGT I I IGI I I CAT TCI I I I I CAT GTACAGTAGT 
19951 CTTAGGATTC TCATGAGCCT GTGAGCTGCT AGAAGGAAAT ACAGCAGTGC 
20001 TTACATTTAT TGCTTCTATT TTATTTTCTA TTTTCTCTTC CTGTCTTCTG 
20051 ATTGTTCTCC TTCTGTCCAC AAACATGCTC TAATTTCCCT AGTATTAAAA 
20101 ATTTTCTGTC I I I IGI IGI I CTTTTATCCT TGCTCCCTTA I I I I IACTGC 
20151 CAGAI I I I IA I I I I IATTTA TTTAI I I I IG AGATGGAGTC TCACTCTGTC 
20201 ACCCAGGCTG GGGTGCAGTG GCGCGATCTC AGCTCACTGC AACCTCCGCC 
20251 TCCCAGCTTC AAGCAATTTT CCTCTTTTAG CCTCCCAAGT AGCTGGGATT 
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20301 ATGGGCACCT GCCACCATGC CTGGCTGATT TTTCTATTTT TAGTAGAGAC 
20351 GGGGTTTCAC CATGTTQGCC ACACTGCTCT CTAACTGCTG ACCTCAGGTG 
20401 AACCACCCGC CTCAGCCTCC AAAAGTGCTG GGATTGCAGG TGTGAGTCAC 
20451 TGTGCCTGGC CTTTTACTGC CAGAI I I I IA AAAGAATAGT CTGTGCTTTA 
20501 GCTCTATTTC CTCATTTACT ACTTCTCTTT AACTCAGTCA TATATGATGT 
20551 TTTGCATAGT AAATGTCTAG TAATTTATTA AAAATGTAGA AATAGGTACT 
20601 TTTAAAATGA ATAGATCCTA CTTTAATTGA ATTTATGTTG GAGTTAGAAT 
20651 ATCTTGATTT GGATTTTAGT TCTGCTACTT CTTAATTACA TTACTTGGTA 
20701 AGGCCACTTG TGAAGTCAGT CTCTTTGGAG GAATATTATT TATCTATAAG 
20751 GCTGTTACAA TTACTGAATT TTAAAAAATG TGTATTTATT TTTTAATGTA 
20801 TTTGTTACAT TTTTAGTATT GATGTTGGGA TAGGCATTTA AGCAAGTCTA 
20851 TAACTCACCT ACATGCATAA TTTTGCCTTA ATCAGTTTAA AGCTTTCTCT 
20901 TAAATG AGAG ATTTGAAATT CATAATTTCT GTGGTTCTTA TCAGTTCTGA 
20951 GTTTTATTTT TTGCCCTTTT TAI I I I I I IA AAGGAAAAAT TGAGGCTTCA 
21001 GAAATTGTCC AGTCTCTCCA GACACTGGGT CTGACTATTT CTGAACAACA 
21051 AGCAGAGTTG ATTCTTCAAA GGTAAGCTCT TCATGTTGGT CAACAATTGA 
21101 CTTTCACTTT AATATCCTGC ATTAGAACTC TGTGTTTGTA AGTGTGGCTT 
21151 TAAAACACCT CCCTAGTCTT CATTATGTAT ATCCAAGATC I I I I IGTCTT 
21201 TTTTCCTCCC ATTCATTTTG TATGTGTACA TTTATCTAAA GTGTAAGAAT 
21251 GGGAAGTGTA AGCTCAGACT GGACTCTTTC TTTCAAGGCC TCAAAGGATA 
21301 GTGGAATGGC AGGAAGTAAG GTTTTAACTC CATAGATGAG GAGCTGAAGA 
21351 GTTTTGGTGT TGCTTTTTCT CCATTTGATT TCTAATGTGA CAGTAAAACT 
21401 CATTGATTCA AACTAAGAAG ACTAGCAGAT TCATCACATT ATTTAACCTA 
21451 GATGTGACTG GAAAAAAGGG AAATTACTAA GCTCTCCAAG CTAACAAAGA 
21501 AATACCTGTT TAAACTTTCA GAAAACAGAA ATGCAAATTT GAACCTTATT 
21551 GTCTGGGGCA ATCAGTTTGA CTATTTAAGT CAGACTTTTA TACTCTTAAT 
21601 GTTTTGTTTC ATGGGATAGA GCAGTAATCT CTGCAGCCCA GGTGCTCTCA 
21651 AATACTCTGT TGCTATAAAC ACAGGGCAGG AACTGATTTT TTATGATAAC 
21701 GTAAAACAGA AAAGGACAAT TATATTGTAT TAATATTGTT GTGAATATTT 
21751 TCAGTCCTCA CATTGTCTAA AAATCTTTCT AAATGGCTTT GTTATTGAAT 
21801 TTATCTCATT TTATATCTGT GCCAACAGCA TTTTCATCCT TTCTCTTCAT 
21851 AATTTCTTTT ACAAACAGCT GCTCAAGAGG AAGGCTCAAA GTCTCAAGGC 
21901 TGAGCACGTA ATGACTTTTG TTAGTACTAG ATGAGAAGGG CTTTCCTGAG 
21951 GAAATGAAAA CCTAAAACAT GAAAAGAAGA TAAACAGAAT TTGGACAGTG 
22001 AGATATAGAG CATATAATAT TCTGCTTCTA AAGTAATATT CTTCTAGGAA 
22051 AGTGAGGGCG TTTCCCTGGC TGTTAGGCCA GAAATCATAT TCCTATATTT 
22101 TCTTTGATAG CTTTAGGAAT AATGCAAATT CTAAGCCCAA GCTTCAGAAT 
22151 AGACTAAGAA GTATTAGCTT AGCTGCCATG ACAAAATACC ATAGGCTGGA 
22201 TGCATTAAAC AATGGAAATT TAG I I I I ICA CAGGTCTGGG AGCTGGGAAG 
22251 TTTAAGATGA GAGTGCCAGC ATGGTTGGGT TGTAGTGAGG GCTCTCTTTC 
22301 TGGCTTGCAG ATAGACCCCT TCTCACTGTA TTGTCATATG GCAGAGAGAG 
22351 AGAGAGAGAG AGAGAGAGAG AGAGAGAGGG GATCTTTCTC TTGCTTTCTA 
22401 TTATAAGGCC ATAGTCCTGT TGGATCAGGG TTCCATTCTT ATGACTTTAT 
22451 TTGACTTTAC CCCCCTAAGA TGCTATCTCC AGATATAATC ACACGGTGGG 
22501 TTAGGGCCTC AACATTTGGA TTTGGGAGGG ACACAGCTCA GTCCATAGCA 
22551 AAGGATAATG CAGAGGGTTG GATATTTAAA AGTAGCTACA CAAI I I I IAA 
22601 TATAAATATT TTATGGTAAC I I I I I I I I I I TTTTGAGATG GAGTCTAGCT 
22651 CTGTTGCCCA GGCTGGAGCG CAATGGTGGG ATCTCAGCTC ACTGCAACCT 
22701 CCGCCTCCCA GGTTCAAGCA ATTCTCCTGC CTCAGCCTCC TGAGTAGTTG 
22751 GGACTATAGG CACGCGCCAC CACGCCTGGC TAI I I I I I I I TTAI I I I IAC 
22801 TAGAGACGGG TTTGCACCAT ATTGGTCAGG CTTGTCTCGA ACTCCTGACA 
22851 TCAGGTGATC CACCCATCTT GGCCTCCCAA AGTGCTGGGA TTACAGAAGT 
22901 GAGCCACCGC GCCTAGCCAG CAGCTTTACT GAGATGTAAT TCACATGCCA 
22951 TAAATTCACT TTTCTAAAGT ATACAATTCA GTGACTTAAA ACATTTATTT 
23001 ATTTTTAAAT TGACAGAATT ACATGTATTT ATCATGTACA ACATGATGTT 
23051 TTGAAGTATA TGTACATTGT GGAGTGACTA AGTCTAGCTA ATTAACATGA 
23101 TACATCTCAT ACTTAATGAT TTCTGTGGTG AGAACACTTT ACATCCATTC 
23151 TCTTAGTATT TTTCAAGAAT ATAATATATT ATTATTAATT GTAGTCTTCA 
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23201 TGTTGTATAG TGGAGCTCTT 
23251 TGTGTCCTTT AACACAAACC 
23301 GCFTCTATGA GATTAACTTT 
23351 TATTTATTTG TCTTTACCTG 
23401 AGGATTTCCT TCTTTTTTTA 
23451 TAGCACATTT TCTCTCTTCA 
23501 ATCTTGGCTA TCGTGAATAG 
23551 TCTTTGACAT ATTGATTTCA 
23601 ACACACATAC ATACAGTGGT 
23651 TTTAATTTTT AAAGGAACTC 
23701 TTAACTCCTC ACCAACAGGG 
23751 CCAACACTTG TTATCTTTTG 
23801 TATGAGGTGA TATCTCATTG 
23851 GTGATATCGA GCI 1 1 1 1 1 1 I 
23901 TGAAAAATGT CTATTGGGGT 
23951 NNNNNNNNNN NNNNNNNNNN 
24001 NNNNNNNNNN NNNNNNNNNN 
24051 NNNNNNNNNN NNNNNNNNNN 
24101 NNNNNNNNNN NNNNNNNNNN 
24151 NNNNNNNNNN NNNNNNNNNN 
24201 NNNNNNNNNN NNNNNNNNNN 
24251 NNNNNNNCCG GGGTTCCCGT 
24301 GCTGGGACTA CCAGGGCACC 
24351 ATGTTGAGTA GAGACGGGGT 
24401 TCCTGGCCTC GTGATCTGCC 
24451 AGGCGTGAGC CACCGCGCCT 
24501 TCGGAAAAGA AACTTGATAT 
24551 TGTTTTGTGG CCTAACATAT 
24601 TTGAGAAGAA TGTGTATTCT 
24651 ATCTGTCCAT TTGTTCTAGA 
24701 TTTTCTGTTG AGATGATTTG 
24751 CCTACTATTG CTGTATTGCA 
24801 1 1 I I IATTTT ATTTTATTTG 
24851 AGACGGAGTC TCACTCTGTC 
24901 GCTCACTGCA GCCCCCGTCT 
24951 TCCGGAGTCG CTGGGACTAC 
25001 TGTAI I I I IA GTAAAGACGG 
25051 ATCTCTTGAC TTCATGATCC 
25101 TACAGGTGTG AGCCACCACC 
25151 TGCTCTGATG TTGGGTTCAT 
25201 CTTATTAAGG GATATGCAAT 
25251 TTAAAATGGG AGGAGTGGAG 
25301 ATCCTOTAT TGAATTGACC 
25351 CCTTTACAAC TTCTGACTTA 
25401 ACTCCTGCTC TCCTTTGGTT 
25451 CTTCACCATC AGTCTGTGTG 
25501 GGCAGCATAT AGTTGGATCT 
25551 TTTTGATTGG ATAATTTAAT 
25601 TAAGGACTTT GTACTACCAT 
25651 ATCCTTTATT LI I MCI IC C 
25701 GATTTTCTCT AGTGGTATGT 
25751 TCTCCTATTG GTTTTTGGTT 
25801 TTAAGAGTTA TAATAGTTTA 
25851 AAAACCCCCC AAAACAAAAA 
25901 TTTGAATTTT TGATGTCACA 
25951 AAATTATTGT AGCTATTATT 
26001 GATGTAAGTG ATTTGCATAC 
26051 GTGTACI I I I TTTTATCAGC 
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GAACTTATTC CTCATGTCAA GCTGAAATTG 
ATACCCGACT CCCAAAGTAT TCTGCTCTCT 
TTCTGATTCC ACATGAGTGA GATCATGCAG 
GCTTATTTCA TTCATATTGT TACAGATAAC 
ATGGCCGAAT AG I I I ICTAT TGTATATGTA 
TGCATTGGTG GACACTTAGG TTGATTCCGT 
TGCTATAATG AACATGGGAA TGCACATGGC 
TTTTATATAT GTGTATATAT ATATGTATAC 
GGGATTGCAG GATCATATGG TAGTTCTATA 
CATACTGCTT TCCATAATGG CTGTATTAGT 
TGCAAAAGTT CCCTTTTCTC TACATACTTG 
TCTCTTTGGT AATAGTCATT CTAAGTGTAG 
TGGCTTTTAT TTGCATTTCT GTGGTAATTA 
I I I 1 1 IGTAC TTTGGCCATT TGTATGTCTT 
I I I I IGGTTG TTTATTTGAG GTTTTNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
CATTCTCCCT GCCTCAGCCT CCCCGAAGTA 
CGCCCACCAC GGCCCGGGCT AAI 1 1 1 I IGT 
TTCACTGTGT TAGCCAGGAT GGTCTTGATC 
CGCCTCGGCC TCCCAGAGTG CTAGGATTAC 
GGCCTGATTT CTAGI I I I I I ATTATTGTGG 
GATTTCATTC TGCTTAAATT TGTTAAGACT 
GATATCCCCT GGTGCATGTT CCATGTGCAG 
CTTGCCATTA GGTGAAATGT TTTATGTCTG 
GTATAGTTTA AGTCTGATGT TTCTTACTGA 
TCTATTGCTG AAGGTAGGGT GTTGAAGTCC 
GTCTCTCTCT CCTTTCAGAC GTATTAATGG 
TTGTTGTTGT TGTTGTTGTT GTTGTTTTTG 
ACCAGGCTGG AGTGCAGTGG CAGGGTCTCG 
CACGGTTCAA GCGATTCTCC TGCCTCAGCC 
AGGCGCATGC CACCACGCCC AGCTAATTTT 
GGTTTCACCA TGTTGGCCAG GATGGTCTTG 
ACCCGCCTTG GCCTCCCAAA GTGCTGGGAT 
CCTGGCCAAT GTTTGGTATT TATCTTTAGG 
ATATATTTAT AAAAAACAAT AGCTACATAA 
ATAAAATATA TAAATTGTGA CACTGAAAAT 
TAAAAGTACC TTCATATAAC TTACTATTAT 
CTTTTATCAT TATATAGGAA CTTTGTTTCT 
AAGTTTGTTT TATATGATAT AAGTAAAGTT 
TCTGTTTCCA TGGAATATCT TTTTCCATTC 
TAI I I I IACA GATGAAATGA GTCTGTCATG 
AG I I I I I I IA ATCCACTCAG ACACTGTGTT 
CCATTCATGT TCAAGGTAAT TATTGATAAG 
TTTGCTTATT GTTTCATGGT TCTTTTATAG 
TCTCTTGCTG TLI I I I I I I I GTGGTTAAGT 
TTTGATTTCT TGU I I I IAT I I I I IGTGTA 
TGTGGTTACC AAGAGGTTAC AAAAAACATC 
TTTTAACTTG ATAACTTAAT TTTTATTGCA 
AATCTACACT TTTACTTAAT CCCCTGAAAT 
GTTTACCTCT TTTCATATTG TGTATCCCTT 
ACTTTTAATA Gl I I ICTCTT TCCTACTACA 
CATCATTACA GTATTATTTT GAATTTACCT 
CAGI I I I ATA CTTTCAGATG I I I I IGTGTT 
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26101 ACTCATTAGC ATCTTTTTCT TTCAGCTTGA GGAGCTCCTT TTACGTTTCT frf 

26151 TATAAAATAG GTGCGGTCAT GATTATCTCC CTCAGCTATT GTTTGTCTGG Q 

26201 GAAAGTATCT CTCCTTCATT TCTGAAGGAC ACTTTGCTGG GTACATTACC S 35 

26251 GTGGTTGGT Al I I I ICTCC TTGAACGCTT TAAATATATC ATCCCTTTCT m ffl m 

26301 CTCCTGACCr GTTAGGTCTC TGCTGACCAG TCTGTTTCCA ACCATATTGG ^ A\ 

26351 GACTGTCTTA TATGTTATTT GCTTCTTATC TTTTGCTGTT TTCAGGATCC W *° ^ 

26401 TCTCATTGTC TTTGAI I I I I GATAGTTTGA TTGTAATATG TCTTGGGGTA ZZ l]j 

26451 GTCTTGTTTG GATTGAATCT GATTAGAGAC CTTGGACTTT TCCTGCATGT § o <- 

26501 AGATATTTAC CTCTTTCTCC AGGTTTGGAA AATTTTCTGT TACTGTTTCT ^ S fn 

26551 TTAATTAAGC TTTTTACCCC TTTTATCTTC CTTTTCTCCT TCTTCAACTC co r-j 

26601 CTGTGACrCA AAACTTTGCT CTTTTGATGC TGTTCCATAA ATCTTGTAAG § ^ 

26651 CI I I <_ I I CAT TCATTTTCAT TCI I I I I ICT CCTCTGTGTA TTTTCAAATA 

26701 ACCTGTCTTT GAGTTCATAG I I ICI I ICI I CI ICI IGATC ACTTCTGCAG 

26751 TTGATGCTCC CATATTGCAT TTTAATTTTG TTCATTGTAT TTTTCAGCCC 

26801 CATGATTTCT GTTTGATTTT TTCTTTTATT ATTTCATCTC TTTATTACCT 

26851 TTCTCTTTGT GGTCACTCGT TATTTTCCTA ATTTCATTGA ATTGTTTCTT 

26901 TGTATTTTCT TGAAGTTTGC TGAGCTTTCT TTGAATTCTA TGTCAGTTCA 

26951 TACATCTCTG I I ICI I IAGG GATGGTGGCT GGTACTTTAT I I ICI I ICI I 

27001 TAGTGGTGTC ATTTGTTCCT GATTGTTGTT GATGTTTGTG GCCTTGTGTT 

27051 TACATCTGTG CATTTGAAGA AGTAGGCACT TATTTCAGTC TTTGCAGACT 

27101 GGCTTTGTCT GAGAATGCCC TTCAACAGTC AGCCTGTCTA GAGATTCTTT 

27151 AATATTTAAT TAAATATCTT TAATATTTTG AAGAACTTCC AAATTGTTTC 

27201 TAAAGTGGCT GCACCATTTT ATAATCCCAG CAGCAATGAA TGAAGGTTTC 

27251 AGTTTCTCCA TAGCTATATG AATACTCATT ACTGTCTGTC TTTTCATTTT 

27301 TTGAI I I I IA I I I I I I I I I I GAGAAAGGGT CTTGCTCTGT CATCCCATCT 

27351 GGAGTGCAAT GGCACAATCA TGGCTCATTG CAGCCTCAAC TTCCCTGGCT 

27401 CAATTGATCC TCTCACCTCC TGAGTACCTG GGACTACAGG CATTGTACCA 

27451 CAATGCCTGG CTAAI I I I IA TAI I I I I IGT AGAGATGTGG TTTTGCCATG 

27501 TTGCCTGGTG TATTAGTCCA TTCTCATGCT GCTATAAAGA ACTGCCTGAG 

27551 ACTGGGTAAT TTATAAAGGA AAGAGGTTTA ATTGACTCAC TTTTGCTTGG 

27601 CTGAGGAGCC CTCAGGAAAC TTACAATCAT GGTGGAAGGG GAAGCAAACA 

27651 CGTCCI ICI I CACATGATGG CAGGAAGAGC AGTGCCTAGC AAAGAGGGAA 

27701 AAAAACCCTT ATAAAATAAT CAGATCTCAT GAGAAGTTAC TCACTATCAT 

27751 GAGAACATCA GAATGAGGGT AGCCTCCTCC ATGATTCAAT TACCTCCCAC 

27801 TGGGTCCCTC ACGTGACATG TGGGGATTAT TGGAACTATA ATTCAAAATG 

27851 AGATTTGGGT GAGGACACAG CCAAACCATA TCATTTTTGC CCTGGTCCCT 

27901 CCCAAATCCC ATGTTCTCAC ATTGCAAAAC ACAATAATGC CTTTCCAGCA 

27951 GTCCCCCAGC GTCTTAACTC ATTCCAGCGT TAACCTAAAA GTCCAAGGTT 

28001 TCATCAGAGA CAAGGCAAGT CCCTTCTGCC TATAAGCCTG TAAAATCAAA 

28051 AGCAAGGTAG TTATTATACT TCCTAGATAC AATGAGGGTA CAGGCATTGA 

28101 TTAAATATAC TTGTTCCAAA TGGGAGAAAT TGGCCAAAAT GAAGGGGCTA 

28151 CAGGCCCCAA GTAAGTCCGA AATCTAGTGG AATAGTCAAA TCTTAAAGCT 

28201 CCAAAATGAT CTCCTTTGAC TCCACATCAC ACATCCAGCT CATGCTAATG 

28251 CAAGAAGTGG GCTCCCATGG CCTTGGGCAT CTGCACTCCT GTGGCTTTTC 

28301 AGGG TACAGA CCCCCTTCTG GCTCI I I ICA CAGGCTGGCG TTGAGTGTCT 

28351 GTGGCTTTTC CAGGTGCATG GTGCAAGCTG TCGGTGGATC TACTATTCTG 

28401 GGTACTGGAG GATGGTGGCC CTCTTTTCAC AGCTCCACTA GGCAGTGCTC 

28451 CAGTGGGGAC TCTGTGTGAA GGCTCCAACC CCACATTTCC CTTCTGCACT 

28501 GCCCTAGCGG AGGTTCTCCT CAAGGGCTCC ACCCCTGCAG CAAACTTCTG 

28551 TCTGGACATC CAGGCATTTC CATACATCCT CTGAAATCTA GGCAGAGGAT 

28601 CTCAAACCTT AATTCTTATC TTCTGTGTAC CCGCAGACTC AACACCTTGT 

28651 GGAAGCTGCC AGGGCTTGGG GCTTGCACCT TCTGAAGCCA TGGCCTGAGC 

28701 TGTACCTTGG CTCCTTTTAG CCATGGCTGG GATGCAGGGC ACCAAGTCCT 

28751 GAGACTGCAC AAAGCAGCAA GGCCCTGGGC CTGGCCCAGG AAACCATTTT 

28801 TTCCTCCTGG GCCTCTGGGC CTATGATGGG AGGGCCCTTC CTGAAGACCT 

28851 CTGAAGTGCC CTGGAGGCAT TTTCCCCATT GTCTTAGTGA TTAACATTTC 

28901 ACTCCTTGTT TCTTATGCAG ATTTCTGCAG CTGGCTTGAA I I I I I ICCTC 

28951 AGAAAATAGA I I I I ICI I I I CTGTCACATC ATCAGGGTGC AAATTTGACA 
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29001 AACTTTTCTC CTCTGCTTCC TGTGGAATGC TTTGCCACTT AGAAATTTCT 
29051 TCTGCCTGAT ACCCCAAATC ATCTCTGTA GGTTCAAAGT TCCACAGATC 
29101 TCTAGGGCAG GGGCAAAAAG CCACCAGTCT CTTTGCTATA GCATAACAAG 
29151 AGTCATCTTT GCTCCAGTTC CCAACAAGTT CCTCATCTCC ATCTGAGATC 
29201 ATCTCAGCCT GGACTTCATT GCCCATATTA CTGTCAGCAT TTTGGTCAAA 
29251 GCAATTCAAC AAGTCTCTGG GAACTTACAA ACTTTCCCAC CTU I I I IGT 
29301 CTTCTGAGCT CrCCAAATTT TTAAGAAGTT CCAAACTTTC CCAGTCTTCT 
29351 TCTGAACCTT CCTAACTGTT CCAACCTCTG CCTGTTACCC AGTTCCAAAG 
29401 TCAGTTCCAT Al I I I IGGGT ATCCTTATAG TAGCACCCAA CTCCTAGTAC 
29451 CAATTTACTG TATTAGTTCA TTCrCACGCT GCTATAAAGA ACCACCTGAG 
29501 AATGGGTATT TTATAAAGGA AAGAGGTTTA ATTGACTCAC AGTTTCGCGT 
29551 GGCTGGGGAG GCCTCAGATA ACTTACAGCC ATAGCAGAAA GGGAAGCAAA 
29601 CATGTCCTTC ACATGGTGGC AGGAAGAAGA AGTGCTGAGC AAAGAGGGAA 
29651 AAGCCCTATA AAACCATCAT ATCTCGTGAG AACTCACTCA CTATCATGAG 
29701 AACAGCAGCA TGGGGTTGAC CACCCCCCAT AATTCAATTA CCTCCCACCA 
29751 GCTGTCTCCC GTGACACATG GAAATTATGG GAACTACAAC TCAAGATGAG 
29801 ATTTGGGTGG GGACACAGCC AAACCATATC ATCTAGGCTG GTATCGAAAT 
29851 CCTGGGCTCA AGCAATCCAC CCACOTGCC CTACCAAAGT GCTGGGATTA 
29901 CAGGCATGAG CCACCATATC TGAACTGTCT TTTGATTTCT TTTGATTTTA 
29951 ACCATCCATT GTTTCTGCTT CTCTAGATAA CCCTGACTAA TATATAATTG 
30001 GTATGAAGTG ATATCTCATG GCTTTGATTT ATA I 1 1 CI I I CATGGCTAGT 
30051 GAL I I I I I I I GTACTTTTGG GATATTGTTA TTATTATTAT TATTATTACT 
30101 AGTGTTTATA CTTCTTCAGT AAAAGTGTTA GAAACAATTT TTAAAGGCAG 
30151 AATGTGACCA GAGTTTCCTG TAGTTATATA ACCATCATGG ACCTTCCCTC 
30201 AAGTGCTAAG CCATTAGTGT TACTCATGTC ACTCCAAATG TCAGCTTGTT 
30251 TTCTTCCATT TCACTGTCTC TTTGTGTCCC AAACTTGAAT TCATGGGAAA 
30301 AACATCTGAA TGGTGCTTAA TATGGTTTGG ATATTTGTCC CCTCCAAATC 
30351 TCATGTTGAA ATATGACCTC CAGTGTTGGA AGTAGGGACT ACTTGGGTCA 
30401 CGAGAGTGGA TCCTTCATTA ATGGCTTGGT AATAAGTGAA CTCTATTAGT 
30451 TCATGAAAGC TGGTTGTTGA TAAGAGCCTG GCATCTGATT TCTCTTGTCC 
30501 TTCTCTCACC ATCTGACACA CTTGCTCACC I I I I I I C I IC AGCCATGAGT 
30551 AAAAGCTTCC TGAGGTCTCA CCAGAAACTG AGCAGATGTT GGTGCCATGC 
30601 TTGTACAGTC TGTAGAACTG TGAGCCAAAT AAGCCTCTTT TCTTTATAAA 
30651 TTACGGAGTC TCAGGTGTTC GTTTAAAACA ACACAAAACA GACTAACACA 
30701 GTGTTGATTG AAACAGCTGT GACTGGGTCA TCAGGGTGTA AGAGAGGAGT 
30751 CACTGAGTTG AAATATAGCC TCCTACTTAC ACCTGTTCAG TAGAAGCTGT 
30801 AGATATGAAG TAGCTGAAGC AGGCATTCCC TCTGAAACAT GTGTTTCACA 
30851 TATGTCATAA TTATCTTCTG CTCTCATTTT TCTTTTAGGC TTTTGTCTCC 
30901 ATCTCATTTC CCCTGTTTAC TCTCATTTTC ATATCTTTAC ATTTCTTTCT 
30951 CCAGAATTGT TCAGAAGCTT GGAACCCTTC ACTCCAGTTA TTCTTTGACT 
31001 ATGCAATTTG TTTCTGTGCT TCATGGCACT TATGGTTTGT AATCCTTGAC 
31051 TTGTTTGTAT AGCTCAGTGG TTAGGAGTAC AGTTTGGAGT TAGAATGCCT 
31101 GGGTTGAAAC TCTTAATTCT ACTCTACTTA CTAGTCTTGT GACTATAACA 
31151 AAATTCTTAG CCTCTCTTTG TCTGTAAAAT GGAGAGTATA GTAAATACAT 
31201 GGGCTTGTTT TAAGGATTAA ATGAGTTAAC ATGTGAAATA CTTAGAACAA 
31251 TGCCTGGGAA ATGCTCAATG AATATTGAGT ATTGCTTGCT TTTGTTTAGT 
31301 GCCATGCCTG TTGTTCCCAC TGAGGGCACA GACCATGTGT ATCTGGTTAA 
31351 CAGTTCTATG TCCACCACGT TGCAATAATG GACTCTCAGA AAATATTGAA 
31401 GAATATGTTA AAGAATGAGT AGAATTATGC TACTGAAAAG GGTGAGTGGA 
31451 AGGTAGGTAG GGGAAAGGAC ATATACAGCC CTGGAGGCAG CATATATGGG 
31501 GAATGGGTCA CACAGTGTTT CTTGGTACTC TCTAGACCAT AGTGGGCCAC 
31551 CTCTTAGCTA GTGGCCTATG GATTATTTCA GCAGTCTGTT GGAAACATCC 
31601 ATGAATATGA TAATAATGAC CCATTTGTGG GTTCTAAGAA AAAGGACAAC 
31651 TACAATACTA GACAATAATA GTATGTAAGT TAGGAGGGAA GGGGATGATT 
31701 TGTATTAAAC TGTTCTAAAA TTCTTACCTT ATTTAGGATG ATGGGGTCAG 
31751 ACATTAACTT TAGACTTTGT TATATATATG TGGTAAAATT TCAAGGTAAA 
31801 CCATTGAAAC TGTAGTAGTT GAGTATATAA CTTCCAAATC AGGGGGGAAA 
31851 GAAATGGAAT AAGAAAATAA ATACATAAAC ATAAGATTGA AACAATCCAA 
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31901 TGAAGAGTAG AGAGAAGAGG GAAAAACATA GAAAGAATGA GATAATTAGA 
31951 AAGCAATAGG TAAGATGTGA GAAATAAATT CAAGTACAGT AAAACTCCAC 
32001 TAAAATGTGC CCTGCAGTAA TGTTGGGGCA TGATTTCCCT TCATCCCCAT 
32051 TCTCAAATGG GGCAGCCTAA ATAGCGTTCT TATCCTGTTT CCCTGGGGGT 
32101 TTGAGGTGGG TGACGAGTAA GTTAGAAGAT AATCACOTC TGATCAGTTA 
32151 GGACTTTCTC AGTTTAGTCT TCAATTAATA AAAATTAATG TAAATTTCAT 
32201 CAGAAGGCAG AGATTGTCAG ATGAAAGAAC AAGCAAAATA AAAGTCTTAC 
32251 TGAAAAAAAG CTGGGGTAGC TATGTTAATA TCAACTGTTA ATTATTATTA 
32301 ATAATCTATT AATAATAGAT TATATAGTAA AAACATTAAT AAAAATAGAG 
32351 TGTCACTACA TTTTAAAATT CAGTATGAGG ATATACAATT TTTAAGCTGG 
32401 TTGATAAAAT TCTGGGGATT AATTGGCAAA TCCATCATAG TGGTGAGAGA 
32451 TTTTAACACA ATTCTTCCTG TATTTGATAG GTCAAGCAGA GAAAAACTTT 
32501 AGTGAAGACA AAAACTTCTA AATACATAAG CTTGATTTAA TGGGCATGTA 
32551 ATAGGACCTA GCATCAAAAA ATTAGAAAAA ATA I I I I I IC TTAGGTATTT 
32601 ATGGAACATG TATAAAAATT GATTTCGTAG TAGGCCATAA AGCCAGGTTC 
32651 AACACATTTC AAAGAACTGG TATCACAAGA ACTGCTTTCT CTGACCACTA 
32701 TGCATTAAAA TAGAAGTTAA TTACAGACAT AAATTATAAA AATGCCAATA 
32751 TTTTAAAGTG TGATATACAC TTCTCAACTT ATGGGTCAAA GGAAATCGTA 
32801 AGTGGAAATT CAAGGACACG TTGACTTGAA AACATTAAAA CTTATGGAAT 
32851 ATTTCTAAGA TGGAACTTGT ATGAATTTTA TAGTCTGAAA GCTTTTATTA 
32901 GAAAAGAATT AAGTCTCAAA ATTAATGTGC TAAGTTAGGG GAGAGAAAAT 
32951 GGAATAATCT CGAAGAAGGT AGGAGGAAGG AGATAATAAA GAATATATAG 
33001 CAAAGATGCA GTAACAGGAT CAACAAAGCC AGAAACTGTT GGAAAAGACA 
33051 AGCCTCTGGA AAGATTGATG AAGAAAAAAG AGAAATGAGA TGTAAATAAA 
33101 TCATGTTCAG TTATAAATAG GCACATAAGG ACTTTTAAAA AACTAATAAA 
33151 ATAATATGAA TCATTAATGC CAATAAATTT GAAAACAGAC AAAGTAGGTG 
33201 AATTTCTAGA AAAATATAAC TTACTGGGAC TGAATGAAGA AGCAACAGCT 
33251 TATAGTACCT AAGCAATTGA AGAGATTGGG TCAGTAATTT AAAATTTTCT 
33301 CATAAACAAA ACGTTAGCCC CAGATGGTTC TTGCAAATGA TTAAAGAACA 
33351 GATGTACAAA CATTTCCAGA GTGTAGAAGT ACACTGTCCT ATCCTTTCTA 
33401 GGAGATCATT ATAACACCAA AAGCAGACAG TATATGAAAC AGGGAAATTA 
33451 GAGGCCAAGA TACCTATGAC TTATATGTAA AAATTTAAAG AAAATATTAG 
33501 CAAACTGAAT CAGCCATTTT AAAAAATATA CCACAATCAA TGCATTCATA 
33551 AGAGCAGCTT AACAAAATTT GTTAGAAGGC ATTAAAGAAG ACTCAGTATA 
33601 GAAAAGATGT ACCTTCTCTC CAAATTGGTG ATAGAGATTC AATGCCATTA 
33651 AAAAAACCCA CCTGGI I I I I TTGAGGAACT TGTCAAGCTG AGTCTCAAAT 
33701 TTATATCAAA GAGCAAAGGC CTAAGAATAT CCAGGACATT CCTGAAGAAC 
33751 TGTAAGGAGC CAGGGGCCTG CCCTATCAGA TACCAAGGGT TGTTATTAAG 
33801 CCATAACCAA GTCAGTGCTG TTTCTACAGA AACAGACAAG TTAACAAGTG 
33851 AAACATAATA GAGAGCCCAG AAACAGACCC ATCCATATTT TGGATTTGTC 
33901 ACGTGAAAGA AGTAGCTTTG CAAAACTTTG GGAAAAGGAG AGTGTGTGCA 
33951 ATAGATGATG CTCGTGCTCA TGCAGACAAA AAGGAAATTG GGATACCTGC 
34001 CTCTTACCGT ACACAAACAC CAACCTAAAC GTGAAAGTTA AACTATAACA 
34051 GCTTGAGGTG GTGGGGAAGA AATATCTTTA TCTCAGTGTA GGGAAGAATT 
34101 TATTTTAAAA AGAAGACACA AAAGGCCATA CATAGGAATG AAAAGATTGA 
34151 ATTCAGCTGC ATTAAAAAGA TTAAATTCAG CTGCGTTAAA ATCAAGAGCA 
34201 TCTGTACTTG GACAGCATAG AGTGGAAAGA CAAAGAGAAG GTATTTGCCA 
34251 GCTTATAACT TGAAGGATTA GAATGAATGA TATAAAGAAC TATGTAAATA 
34301 AGAAAAAGAC ATACAACCGG TTAGAAAAAC GGGCAAAGAC ATGAACAGCA 
34351 TATTTCACGT GAAGGAAACA GCGGTAGCAA ATGAACATGG TAAGAGATGC 
34401 TCAACACGTT TAGTAATTTG AAGGGAAATG CAAGTTATAC CCACAGCAAG 
34451 ACTATCTTAT CTAGGAAGTT TGTCAATACC CTAAATGTTC TGTGGTTTTA 
34501 AGCTACAGAG TTTGTAATTC ATTTATTTAT TCAATAAATA CTCAGTGGCA 
34551 GGCACTGTTT TAGAAACCTT GGTTATAACT TTGAATGAAA TTAAAAAAAA 
34601 TCCTTGCCTT GTGGAGGATG CTTATGTGTG GGGAGTTGGG TGGTGGGGTC 
34651 AAACAACAAT TACATTAAAA TAGAAAATAG TGACATAAAT AAACCTATAA 
34701 ATATTGCAAC CCAGAGTTAT ATTATAAATG TAAGTAGTGA CTAGGACTCT 
34751 CATGCAGATA TACCTCTGTG CTGGGACAAA TGAAAGTTTA AGTGTAATTT 
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34801 CCCATATGCA ACTCAAAATA AAAAGTGACA CTAGAAAACA CAATAATGAA 
34851 TATCTGAAAA TTGCATTTTA TTTGACTGCC ATCCTTTTGC ATCATTTTCA 
34901 TACTAATTAT AGAATAAAAT TTGTAGGATG CACCAAAGCT I I I I I IAGAG 
34951 ACATCCATTA ATTCAATAAA TAAATGAGCA CCTTCTTTGT GCCAGCAGCT 
35001 GTAAGAGGTG GCCCAAGGAA QGGAATAAAA CAGTCAAAAT CCTGGTACAC 
35051 TCAGAGTTTC TCTTAGGAGA AAACAGATAC AAATGGCATT AATTACCAAG 
35101 AAACTTGTAA AACAAGCCAA ATATTAATGA TAAATATTTG AGTACAGTAT 
35151 GTTAATTTTA AGATTGAAAA TGAGGTGCCA GGATTTCTTA AGACTCAAAG 
35201 GCGAAGATGG CTGAATAGGA ACAGCTCTGG TCTACAGCTC CCAGCGTGAG 
35251 CGACGCAGAA GACGCATGAT TGCTGCATTT CCATCTGAGG TACCGGGTTC 
35301 ATCTCACTAG GGAGTGCCAG ACAGTGGGCG CAGGTCAGTG GGTGTGTGCA 
35351 CCGTGCGCGA GCTGAAGCAG GGCGAGGCAT TGCCTCACTC GGGAAGTGCA 
35401 AGGGGTCAGG GAGTTCCCTT TCCTAGTCAA AGAAAGGGGT GACAGATGGC 
35451 ACCTGGAAAA TCGGGTCACT CCCACCTGAA TACTGCACTT TTCTCACGGG 
35501 CTTAAAAAAT GGCGCACCAG GAGATTATAT CCTGCACCTG GCTCGGAGGG 
35551 TCCTACACCC ACGGAGTCTC GCTGATTGCT AGCACAGCAG TCTGAGATCA 
35601 AACTGCAAGG CGGCGGCGAG GCTGGGGGAG GGGCACCCGC CATTGCCCAG 
35651 GCTTGCTTAG GTAAACAAAG CAGCCGGGAA GCTCAAACTG GGTGGAGCCC 
35701 ACCACAGCTC AAGGAGGCCT GCCTGCCTCT GTAGGCTCCA CCTCTGGGGG 
35751 CAGGGCACAG ACAAACAAAA AGACAGCAGT AACCTCTGCA GACTTAAATG 
35801 TCCCTGTCTG ACAGCTTTGA AGAGAGCAGT GGTTCTCCCA GCACGCAGCT 
35851 GGAGATCTGA GAACGGGCAG ACTGCCTCCT CAAGTGGGTC CCTGACCCCT 
35901 GACGCCCGAG CAGCCTAACT GGGAGGCACC CCCCAGCAGG GGCACACTGA 
35951 CACCTCACAC AGCCGGTTAC TCCAACAGAC CTGCAGCTGA GGGTCCTGTC 
36001 TGTTAGAAGG AAAACTAACA AACAGAAAGG ACATCCACAC CAAAAACCCA 
36051 TCTGTACATC ACCATCATCA AAGACCAAAA GTAGATAAAA CCACAAAGAT 
36101 GGGGAAAAAA CAGAGCAGAA AAACTGGAAA CTCTAAAAAG CAGAGTGCCT 
36151 CTCCTCCTCC AAAGGAACGC TGTTCCTCAC CAGCAACGGA ACAAAGCTGG 
36201 ATGGAGAATG ACTCTGACGA GCTGAGAGAA GGCTTCAGAC GATCAAATTA 
36251 CTCTGAGCTA TGGGAGGACA TTCAAACCAA AGGCAAAGAA GTTGAAAACT 
36301 TTGAAAAAAA TGTAGAAGAA TGTATAACTA GAATAACCAA TACAGAGAAG 
36351 TGCTTAAAGG AGCTGATGGA GCTGAAAACC AAGGCTCGAG AACTACATGA 
36401 AGAATGCAGA AGCCTCAGGA GCTGATGCGA TCAACTGGAA GAAAGGGTAT 
36451 CAGCGATGGA AGATGAAATG AATGAAATGA AGCGAGAAGG GAAGTTTAGA 
36501 GAAAAAAGAA TAAAAAGAAA CGAGCAAAGC CTCCAAGAAA TATGGGACTA 
36551 TGTGAAAAGA CCAAATCTAT GTCTGATTGG TGTACCTGAA AGTGACGGGG 
36601 AGAATGGAAC CAAGTTGGAA AACACTCTGC AGGATATTAT CCAGGAGAAC 
36651 TTCCCCAATC TAGCAAGGCA GGCCAACATT CAGATTCAGG AAATACAGAG 
36701 AACGCCACAA AGATACTCCT TGAGAAGAGC AACTCCAAGA CACATAATTG 
36751 TCAGATTCAC CAAAGTTGAA ATGAAGGAAA AAATGTTAAG GGCAGCCAGA 
36801 GAGAAAGGTC GGGTTACCCT CAAATGGAAG CCCATCAGAC TAACAGCGGA 
36851 TCTCTTGGCA GAAACTCTAC AAACCAGAAG AGAGTGGGGG CCAATATTCA 
36901 ACATTCTTAA AGAAAAGAAT TTTCAACCCA GAATTTCATA TCCAGCCAAA 
36951 CTAAGCTTCA TAAGTGAAGG AGAAATAAAA TCCTTTACAG ACAAGCAAAT 
37001 GCTGAGAGAT TTTGTCACCA CCAGGCCTGC CCTAAAAGAG TTCCTGAAGG 
37051 AAGTGCTTAA CTTGGAAAGG AACAATCAGT ACCAGCCGCT GCAAAATCAT 
37101 GCCAAAATGT AAAGACCGTC GAGACTAGGA AGAAACTGCA TTAACAAACG 
37151 AGCAAAATAA CCAGCTAACA TCATAATGAC AGGATCAAAT TCACACATAA 
37201 CAATATTAAC TTTAAATGTA AATGGACTAA ATGCTCCAAT TGAAAGACAC 
37251 AGACTGGCAA ATTGGATACA GAGTCAAGAC CCATCAGTGT GCTGTATTAA 
37301 GGAAACCCAT CTCACATGTA GAGACACACA TAGGCTCAAA ATAAAAGGAT 
37351 GGAGGAAGAT CTACCAAGCA AATGGAAAAC AAAAAAAGAC AGGGGTTGCA 
37401 ATCCTAGTCT CTGATAAAAC AGACTTTAAA CCAACAAAGA TCAGAAGAGA 
37451 CAAAGAAGGC CATTACATAA TGGTAAAGGG ATCAATTCAA CAAGAAGAGC 
37501 TAACTATCCT AAATATATAT GCACCCAATA CAGGAGCACC CAGATTCATA 
37551 AAGCAAGTCC TGAGTGACCT ACAAAGAGAC TTAAACTCCC ACACATTAAT 
37601 AATGGGAGAC TTTCACACCC CACTGTCAAC ATTAGACAGA CCAATGAGAC 
37651 AGAAAGTCAA CAAGGATACC CAGGAATTGA ACTCAGCTCT GCACCAAGCA 
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37701 GACCTAATAC ACATCTACAG AACTCTGCAC CCCAAATCAA CAGAATATAC 
37751 Al I I I I I ICA GCACCACACC ACQGCTATTC CAAAATTGAC CACATACTTG 
37801 GAAGTAAAGC ACTCCTCACC AAATGTAAAA GAACAGAAAT TATAGCAAAC 
37851 TATCTCTCAG ACCACAGTGC AATCAAACTA GAACTCAGGA TTAAGAATCT 
37901 CACTCAAAAC CGCTCAACTA CATGGAAACT GAACAACCTG CTCCTGAATC 
37951 ACTACTGGGT ACATAACGAA ATGAAGGCAG AAATAAAGAC GCTCTTTGAA 
38001 ACCAACAAGA ACAAAGACAC AACATACCAG AATCTCTGGG ACGCATTCAA 
38051 AGCAGTGTGT AGAGGGAAAT TTATAGCACT AAATGCCCAC AAGAGAAAGC 
38101 AGGAAAGATC CAAAATTGAC ACCCTAACAT CACAATTAAA AGAACTAGAA 
38151 AAGCAAGAGC AAACACATTC AAAAGCTAGC AGAAGGCAAG AAATAACTAA 
38201 AATCAGAGCA GAACTGAAGG AAATAGAGAC ACAAAAAACC CTTCAAAAAA 
38251 TTAATGAATC CAGGAGCTGG TTGTTTTTGA AAGGATCAAC AAAATTGATA 
38301 GACCGCTAGC AAGACTAATA AAGAAAAAAA GAGAGAAGAA TCAAATAGAC 
38351 ACAATAAAAA ATGATAAAGG GGATATCACC ACCAATCCCA CAGAAATACA 
38401 AACTACCATC AGAGAATACT ACAAACACCT CTATGCAAAT AAACTAGAAA 
38451 ATCTAGAAGA AATGGATAAA TTCCTCGACA CATACACCCT CCCAAGACTA 
38501 AACCAGGAAG AAGTTGAATT TCTGAATAGA CCAATAACAG GATCTGAAAT 
38551 TGTGGCAATA ATCAATAGCT TACCAACCAA AAAGAGTCCA GGACCAGATG 
38601 GATTCACAGC CGAATTCTAC CAGAGGTACA AGGAGGAACT GGTACCATTC 
38651 CTTCTGAAAC TATTCCAATC AATAGAAAAA GAGGGAATCC TCCCTAACTC 
38701 ATTTTATGAG GCCAGCATCA TCCTGATACC AAAGCCAGGC AGAGACACAA 
38751 CAAAAAAAGA GAATTTTAGA CCAATATCCT TGATGAACAT TGATGCAAAA 
38801 ATCCTCAATA AAATACTGGC AAACTGAATC CAGCAGCACA TCAAAAAGCT 
38851 TATCCACCAT GATCAAGTGG GCTTCATCCC TGGGATGCAA GGCTGGTTCA 
38901 ATATACGCAA ATCAGTAAAT GTAATCCAGC ATATAAACAG AACCAAAGAC 
38951 AAAAACCACA TGATTATCTC AATAGATGCA GAAAAAGCCT TTGACAAAAT 
39001 TCAACAACAC TTCATGCTAA AAACTTTCAA TAAATTAGGT ATTGATGGGA 
39051 TGTATCTCAA AATAATAACA GCTATCTATG ACAAACCCAC AGCCAATATC 
39101 ATACTGACTG GGTAAAAACT GGAAGCATTC CCTTTGAAAA CTGGCACAAG 
39151 ACAGGGATGC CCTCTCTCAC CACTCCTATT CGACATAGTG TTGGAAGTTC 
39201 TGGCCAGGGC AGTTAGGCAG GAGAAGGAAA TAAAGGGTAT TCAATTAGGA 
39251 AAAGAGGAAG TCAAATTGTC CCTGTTTGCA GACGACATGA TTGTATATCT 
39301 AGAAAACCCC ATTGTCTCAG CCCAAAATCT CCTTAAGCTG ATAAGCAACT 
39351 TCAGCAAAGT CTCAGGATAC AAAATCAATG TACAAAAATC ACAAGCATTC 
39401 TTATACACCA GCAACAGACA GAGAGCCAAA TCATGAGTGA ACTCCCGTTC 
39451 ACAATTGCTA CAAAGAGAAT AAAATACCTA GGAATCCAAC TTACAAGGGA 
39501 TGTGAAGGAC CTCTTCAAGG AGAACTGCAA ACCACTGCTT AATGAAATAA 
39551 AAGAGGATAC AAACAAATGG AAGAACATTC CATGCTCATG GGTAGGAAGA 
39601 ATCAGTATCG TGAAAATGGC CATACTGCCC AAGGCAATTT ACAGATTCAA 
39651 TGCCATCCCC ATCAAGCTAC CAATGACTTT CTTCACAGAA TTGGAAAAAA 
39701 CTACTTTAAA GTTCATATGG AACCAAAAAA GAGCCCGCAT TGCCAAGTCA 
39751 ATCCTAAGCC AAAAGAACAA AGCTGGAGGC ATCATGCTAC CTGACTTCAA 
39801 ACTATACTAC AAGGCTACAG TAACCAAACC AGCATGGTAC TGGTACCAAA 
39851 ACAGAGATAT AGACCAATGG AACAGAACAG AGCCCTCAGA AATAACGCCG 
39901 CACATCTACA ACTATCTGAT CTTTGACAAA CCTGAGAAAA ACAAGCAATG 
39951 GGGAAAGGAT TCCCTATTTA ATAAATGGTG CTGGGAAAAC TGGCTAGCCA 
40001 TATGTAGAAA GCTGAAACTG GATCCCTTCC TTACACCTTA TACAAAAATC 
40051 AATTCAAGAT GGATTAAAGA CTTAAACGTT AGACCTAAAA CCATAAAACC 
40101 CCTAGAAGAA AACCTAGGCA TTACCATTCA GGACATAGGC ATGGGCAAGG 
40151 ACTTCATGTC TAAAACACCA AAAGCAATGG CAACAAAAGC CAAAATTGAC 
40201 AAATGGGATC TAATTAAACT AAAGAGCTTC TGCACAGCAA AAGAAACTAC 
40251 TATCAGAGTG AACAGGCAAC CTCCAAAATG GGAGAAAATT TTTGCAACCT 
40301 ACTCATCTGA CAAAGGGCTA ATATCCAGAA TCTACAATGA ACTCAAACAA 
40351 ATTTACAAGA AAAAAAACAA ACAACCCTAT CAAAAAGTGG GTGAAGGACA 
40401 TGAACAGACA CTTCTCGAAA GAAGACATTT ATGCAGCCAA AAAACACATG 
40451 AAAAAATGCT CACCATCACT GGCCATCAGA GAAATGCAAA TCAAAACCAC 
40501 AATGAGATAC CATCTCACAC CAGTTAGAAT GGCAATCATT AAAAAGTCAG 
40551 GAAACAACAG GTGCTGGAGA GGATGTGGAG AAATAGGAAC ACTTTTACAC 
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40601 TGTTGGTGGG ACTGTAAACT AGTTCAACCC TTGTGGAAGT CAGTGTGGCA 
40651 ATTCCTCAGG GATCTAGAAC TAGAAATATC ATTTCACCCA GCCATCCCAT 
40701 TACTGGGTAT ATACCCAAAG GACTATAAAT CATGCTGCTA TAAAGACACA 
40751 TGCACATGTA TGTTTATTGT GGCACTATTC ACAATAGCAA AGACTTGGAA 
40801 CCAAGCCAAA TGTCCAACAA TGATAGACTG GATTAAGAAA ATGTGGCACA 
40851 TTTACACCAT GGAATACTAT GCAGCCATAA AAGATGAGTT CATGTCTTTT 
40901 GTAGGGACAT GGATGAAATT GGAAATCATC ATTCTCAGTA AACTATCACA 
40951 AGAACAAAAA ACCAAACACC GCATATTCTC ACTCATAGGT GGGAATTGAA 
41001 CAGTGAGAAC ACATGGACAC AGGAAGGGGA ACATCACACT CrGGGGACTG 
41051 TTGTGGGGTG GGGGGAGGGG GAGGGATGGC ATTGGGAGAT ATACCTAATG 
41101 CTAGATGACG AGTTAGTGGG TGCAGCGCAC CAGCAAGGCA CATGTATACA 
41151 TATGTAACTA ACCTGCACAT TGTGCACATG TACCCTAAAA CTTAAAGTAT 
41201 AATAATAAAA AAAAAAGACT CAAAGGCACA GTCACTGACA GTTTGATTTT 
41251 TTATAATAGC TGTTAATTTT CCTAACTTCG AGGAAGTTGA TAGCATGTTT 
41301 TGAGTATATT TCAAAACTAC ATTCAAATGT TGCAATAGAA CATTAAGAAT 
41351 TATCTTCATG ATCCACTAAG TGCATGAAAA AAATGGATAA TGAATCTATT 
41401 CATTACCATC GTTTAATATT TTATCTTCAA Gl I I I IGTGT TTTGTAGCTC 
41451 ATTGGCAGAG TTTGACAGAG TGCTGAAAGT ATTCTTTAGT GAGCTGGCTG 
41501 TAAI I I I IGG GCCCAI I I I I ATCTAGATAA TTAAAACTAT CTGACAGGAC 
41551 CATAAAATGC TTGCTGCCAT TTCCAACAAC CTATATTTGT GGATGGGGTT 
41601 TTTTAATTTA ATGAGAATAT TATGTTAGAA AAGAAACTGT CATTCTGTAA 
41651 AGTGGCCAAT AATGTTAGTT TTATTTATCA ATTTAGTTTT GTACTTTGAT 
41701 CAI I I I I I IA AAATTTCAGC ATTGATGTTG ATGGGACAAT GACAGTGGAC 
41751 TGGAATGAAT GGAGAGACTA CTTCTTATTT AATCCTGTTA CAGACATTGA 
41801 GGAAATTATC CGTTTCTGGA AACATTCTAC AGTAAGTCTA CTTTATGTAT 
41851 TTATACTTAT TTGGAGCTAT AAACCATAGG TACAGTTATC ACCCAAGAAC 
41901 ACTCTGTAAC ACTTATGGGC CAGGATACCT GAGTCCCAGT AGCTCCTTAA 
41551 CCTGTAGAGT TCTATTTATT CTATTAGGCA TAGATTTATA GAGTATTAAA 
42001 CAAAAAAAAA CAGCTCTCCC TCTCCCTCTC CCTCTCTCTC CCCCTCCCCA 
42051 CGGTCTCCCT CTCCCTCTCT TTCCACGGTC TCCCTCTGAT GCCGAGCCAA 
42101 AGCTGGACTG TACTGCTGCC ATCTCGGCTC ACTGCAACCT CCCTGCCTGA 
42151 TTCTCCTGCC TCAGCCTGCC GAGTGCCTGC GATTGCAGGC GCGCACCGCC 
42201 ACGCCTGACT Gl I I I ICGTA I I I I I I IGGT GGAGACGGGG TTTCGCTATG 
42251 TTGGCCGGGC TGGTCTCCAG CTCCTGACCG CGAGTGATCC ACCAGCCTCG 
42301 GCCTCCCGAG GTGCTGGGAT TGCAGACGGA GTCTCGTTCA CTCAGTGCTC 
42351 AATGGTGCCC AGGCTGGGGT GCAGTGGCAT GATCTCGGCT CGCTACAACC 
42401 TCCACCTCCC AGCCGCCTGC CTTGGCCTCC CAAAGTGCCA AGATTGCAGC 
42451 CTCTGCCCAG CCGCCACCCC GTCTGGGAAG TGAGGAGCGT CTCTGCCTGG 
42501 CCGCCCATCG TCTGGGATAT GAGGAGCCCC TCTGCCTGGC TGCCCAGTCT 
42551 GGAAAGTGAG GAGTGTCTCT GCCCGGCCGC CATCCTGTCT AGGAAGTGAG 
42601 CGTCTCTGCC CGGCCGCCCA TCGTCTGGGA TGTGAGGAGC CCCTCTGCCT 
42651 GGCTGCCCAG TCTGGAAAGT GAGGAGCGCC TCTTCCCGGC CGCCATCCCA 
42701 TCTAGGAAGT GAGGAGCGTC TCTGCCCGGC CGCCCATCGT CTGAGATGTG 
42751 GGGAGCGCCT CTGCCCCGCC GCCCGGTCTG GGATGTGAGG AGCGCCTCTG 
42801 CTCGGCCGCC CCGTCTGAGA AGTGAGGAGA CCCTCCGCCC GGCAGCCGCC 
42851 CCGTCTGGGA AGTGAGGAGC GTCTCCGCCC GGCAGCCACC CTGTCCGGGA 
42901 GGGAGGTGGA GGGGTCAGCC CCCCGCCCGG CCAGCCACCC CATCCGGGAG 
42951 GTGAGGGGTG CCTCTGCCCG GCCGCCCCTA CAGGGAAGTG AGGAGCCCCT 
43001 CTGCCCGGCC ACCACCCCAT CTGGGAGGTG TACCCAACAG CTCATTGAGA 
43051 ACGGGCCATG ATGACAATGG CGGTTTTGTG GAATAGAAAA AGGGGAGAGG 
43101 TGGGGAAAAG ATTGAGAAAT CGGATGGTTG CTGTGTCTGT GTAGAAAGAG 
43151 GTAGACATGG GAGACTTTTC ATTTTGTTCT GTACTAAGAA AAATTCTTCT 
43201 GCCTTGGGAT CCTGTTGATC TATGACCTTA CCCCCAACCC TGTGCTCTCT 
43251 GAAACATGTG CTGTGTCCAC TCAGGGTTAA ATGGATTAAG GGCGGTGCAA 
43301 GATGTGCTTT GCTAAACAGA TGCTTGAAGG CAGCAGGCTC GTTAAGAGTC 
43351 ATCACCACTC CCTAATCTCA AGTACCCAGG GACACAAACA CTGCGGAAGG 
43401 CCGCAGGGTC CTCTGCCTAG GAAAACCAGA GACCTTTGTT CACTTGTTTA 
43451 TCTGCTGACC TTCCCTCCAC TATTCTCCTG TGACCCTGCC AAATCCCCCT 
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43501 CTGCGAGAAA CACCCAAGAA TGATCAATTA AAAAAAAAAA AAAAAAAACA 
43551 ACCCAAGACT GCATAAATGT CCATTCTGAA AACTTGGAAG AAGTACCACC 
43601 TTGATGAATA AGCTGTCTAG CTTTTATTGG CATTTAAGTA TTCTGCCATA 
43651 GGGAAGTGTA AAAGTTGTAG GCTTTTACTT TTTATAGGTA CTATATTGTC 
43701 CAAATAATCT CAGCACCTCA TGGTTGCTAA GGATCTGTGT CCTTGTTTGG 
43751 TCAGATTATG TTTATCTCTG GCATAAGGCA CTTAACAATA TTCATTAAAG 
43801 GTTACAGAAT (.1111 1 GOT CATCTGCTTA GCATTTCATA CCAGTTTGTT 
43851 TTCCACCAAA CTTTCAAATT TTGATTGTTT CATTAATATT CTGCATACTG 
43901 ATGTAAACCA AGTTCTATTA TTGTGCAATC TGCTCCTGAA ACCCTTAGGA 
43951 ACTCTCTGAA GGAGI I I I AT TTATTTTTTG I I I I IGI I I I TGI I I I IGI I 
44001 TTGI I I I I I I GAGACGGAGT CTTGCTCTGT TGCCCAGGCT AGAGTGCAGT 
44051 GGTGCGATCT CGGCTCTCTG CAAACTCGGC CTCCGGGGTT CACGCCATTC 
44101 TCCTGCCTCA GCCACCGGAG TAGCTGGGAC TACAGGCACC CACCACTGCG 
44151 CCTGGCTAAT I I I I I I IGTA I I I I IAGTAG AGACGGGGTT TCACCGTGTT 
44201 AGCCAGGATG GTCTCGATCT CCTGACCTTG TAATCCGCCC GCCTCGCCTC 
44251 CCAAAGTGCT GGGATTACAG GCGTGAGCCA CTGTGCCCGG C L I I I 1 1 I I I 
44301 I I I I I I I ICT TTATGGGCTT GTCTTCTACA CTTCAGATTT GACTAAATTA 
44351 AATATGCATT AAATGAAGTC AGGAGTTCAC ATTGCCACTA GTAACAATGC 
44401 CTAAGCTTAC ATAAAGCATT ATAAAATTGT TGGTGATTAG TGCCTTCTCA 
44451 GCTATGAGTA TAAGATAATA TTATACTAGT AGTTCAGTTG CCTAGATAAA 
44501 TTGTACACTA TGTGAAGTTT TATTTACATA ATTCTT ACGG T AI I I I I IA A 
44551 GGTAGTTGAT AACAGTTGAG ACTACAATTG TATCTCCATT TTATTGATAG 
44601 TAAAATGAAG GAAGGGAGGG TTACTACCAT AGGAGAGCTC CTCCCCGTTG 
44651 CACTCTTGCC TGTAAAAATT TTTCTGCCAA AACAATTTAG ATAATAGAAT 
44701 TGTAAAAATA TTATTATAGA ATTGTTTCTC TCAAACTATA GTAATGTAGA 
44751 ATAGGTTGAA GGGGTGATGA TTTGAAACAA TACCTCTCCA TTAGCTAAAT 
44801 TT TATATAG A ATCTATTGCA TGI I I IAAAT GATAAGTCAG ATTTATAAAA 
44851 ATA 1 1 I NAT AAACAGTAGG AAATGAGTTT AGGGGTATTC ACATACAGTT 
44901 TTAAI I I I IA TTTACATATT TAAAACATAT CATGGTATAA ATATGATGTG 
44951 GATATAAATT TGAGATAAAG GAAGTATTGT TTAAGAATTG ATGAACTAAT 
45001 TTCTTAAAAG ATGTCATCAC CAGTTGGTTT TCTAGCCTTA TGAAAAATGG 
45051 TTGCAATAM AAAGATTGAC TATGATAAAA TGCTGCCCTT TCATTTTAAC 
45101 CTAGACCAAG AGAAAACATA CTGTGAATCT ATGATGAATG AAAGAAAGTT 
45151 GTAACTGTTG GIN IGTATA TTTGTAATTA CTGTTTATTT TCATTTCTTG 
45201 TGAACTGATA CTGTACTTTG TTCATTGTGA GTAGACAACT TATAATCTAT 
45251 GTACTCAAAT TGGTTTAGTA TAAATTCTAG GGAATGAAGT TCATATTAAC 
45301 TGTAAAATAA CATGATTGTT CTCTAAAACA AAACGTCTTC TGGGATTATT 
45351 TTTAACTAAG GCGCATGGGG ATLI I I I I I I CAI I I I IACA GGGAATTGAC 
45401 ATAGGGGATA GCTTAACTAT TCCAGATGAA TTCACGGAAG ACGAAAAAAA 
45451 ATCCGGACAA TGGTGGAGGC AGCTTTTGGC AGGAGGCATT GCTGGTGCTG 
45501 TCTCTCGAAC AAGCACTGCC CCTTTGGACC GTCTGAAAAT CATGATGCAG 
45551 GTGAGCTTTA TTATCGTGTG TCCAGGTTTG CCCTAAATAT TCTAAAACAA 
45601 TGAGAAATGT GGTGCTTTGA AAAAGAAGTT TTAAAATTTC TCAGTAATAA 
45651 TCI I I IATAC CCTAAAAAAT AAATCTATTT TGTTGCTGTT AACTCTAAAT 
45701 TCAGTCCATG TAAGTATGGC AGTGTACCAA ACCTTAAATT GTTAGTACAT 
45751 GTGTGTAATG AACI I I IAAT CTTTGGCATT CTATGACTAT TCAAACATTT 
45801 AATTCAAAAA ATATCTCTAG CTATTGTTGT AGGATTCTCC TGATTTATAG 
45851 TTTCCTTCTT TTTAATATAC TTTATCAAAA GTAAAGTATT TTTGAAATCT 
45901 AGACTCTTAG AGCAGCAATG TAATTTTGAA AATTATTCTA AAGCTGAGGT 
45951 TAGCAGAAAA AGATCTGGCT TTATAGACTG ACTTTGCTAT TTACTAGCAG 
46001 TGTAGCATTG GGCTGGCCAG AGTGGAAAGA GGGAATGGAA AAGAATTAAT 
46051 ATGTATTTGC TCACTGTGGT AACCCAGTTA ATCCTTGCAG CAGCCCAGTG 
46101 AAGTAGGTAT TTTATCATTT TTCCAGGGGG AATCTGAGGC CCAGAGAATT 
46151 GACTTTTCCT TTACAACAAA TGAGAGGGGG AATGCAGTAT CTTTGCCTCC 
46201 AGTGCTCCTG GTTCTCATGC TGCATGAAAC CTCTGAGGTC TCATTTTCCT 
46251 TCATTCTGGG ATGGGGATAA GAATATCTAA TAAGAATGGT TTAAGAATCA 
46301 AGCAATATCA GGTATGTGAT AATGTCTGGT ACACTGGAAT AACCTATTGG 
46351 AACATAGTAG TTGTTTACAA AATAI I I I IA AAACTTTGTT ATACTTATGG 
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46401 TCAACACTTT TTATATTTGT CTGTAGATTT CTGTACAAAA AGATTCTGAC 
46451 ACTGTTTTAA GCCAGCATTC CTTCAGAATG TACCCAAATC TCAAAATTTA 
46501 TTTAQGGGCA AAGCTAATGC TTTAAAGAAA AAGGAGAGGG GATTGGTGTG 
46551 TGI I I I ILI I TAGGAACAGT AGTAACTTGA CTTTTAGAGA ACTTGAATAA 
46601 GCATTTATTT TTTCCTTTGT CCTATTTTAT TGTGAAGTTT ATTTATTTAA 
46651 AATAAAATGG ATTTCTCTGG AATTTAGTTT CTGCAMTTT GAGGAGTTTC 
46701 CAAAGTCAAC CTTCAGGTTT GATACTTCTC TAGAAAGACT CACATAACTC 
46751 ACTGAAAGCT TATTACCCCT GGTTATGGTT TATTACGGGG AAAAGATGCG 
46801 GATGAAAATC AGTCAAGTAA AGAAGCACAT AGGGCAGAGC TTCTGTTGTC 
46851 CTCTCCCTGT GGAGTCTCCA TGTCTTACTT TCCTGGCACT GTTATGTGGC 
46901 ACTAGGCATG GAATATTGCA GACCAACCAG GGAAGCTCAC CTGAGCCTTT 
46951 GGTGTGCAGA GTTCTTATTG GGGCCTGTTT TCATACTGGC CACATGGCTG 
47001 GCCTTCAGAA TTCAACCCGT TCTGTGAGTG TGTGTGTGTG TGTGTGTGTG 
47051 TGTGTGTGTG TGTTTAGTGG TAGTCACCCC TTTTATGTGA GCTGAAACAA 
47101 TCAGAAGAAT AGCTGATTTG TTTAATTATT TTTGGTGTAT TGGACTTAAT 
47151 CAGTTTTTAT CTGTAGGTGG TCATAAGGTA CAGTAI I I I I AAGTGACTAC 
47201 CACATCTGTA GTATAAGCCA AGTAATTTAT CAGTACTCAC AGGATGGGTA 
47251 CATGTTGTAA TGAATTTATT GCCTAGAGAG GGCCTCAAAA TATGCCAAAG 
47301 AGGGTGCAAT TTTTATTTTT GGTTTCAGGC TGTATGCATT CCAGTGTTGG 
47351 TAGCCCTGAT ATACACAATA TCCAAACCAT TTCAGACCCA TTTACAGTTC 
47401 ATGTCTGTAC TACTTCTTGA GGAGAGGGAG TAACATATTA CTTTAAATTA 
47451 TATGTAATAA TATACATACA TTAAATTATA TGTAATAATA TAATATTATT 
47501 ATTTGCAGTA TALI I I I I IA TTTCCCTTTA ACTGAGCTTG TTCATGTTTC 
47551 AAAGGGTGTT CCATTGCCTG ATACATAATT TAGTTAATAT TATCTTATGA 
47601 AGGTTGTTCA TAATTTTAAT ACTCTTCTTG TCTTCTCTCT CTGCTTTCTC 
47651 ACACTGAAGA TACCAATTAT TCTTAGTTTT AGAGTCAGAG ACAGGCCTCT 
47701 AAAATCATGG CAATACTCCC TCTCATCATT ATATATATTT TTCAACCTTT 
47751 CTATATTTTA TTTTCAAATA TATCTTCTTG CAGTTAGAAA CGGTATTGAA 
47801 AAAGATTGTG TGGTTGTTCT AGAAAAAGTA ATAGTAATAT GCCACCAGCA 
47851 TTTTATATCA TTCTGCTTTT Al I I I IAGGT TCACGGTTCA AAATCAGACA 
47901 AAATGAACAT ATTTGGTGGC TTTCGACAGA TGGTAAAAGA AGGAGGTATC 
47951 CGCTCGCTTT GGAGGGGAAA TGGTACAAAC GTCATCAAAA TTGCTCCTGA 
48001 GACAGCTGTT AAATTCTGGG CATATGAACA GGTAATTGTT ATCACCCGTG 
48051 GAATTTATTA ACAAAGAGGA GTTAGTAAAC GGATTCAATA AATGTTAATG 
48101 TATAATGCTT TTGGGATTCT TGTTTTAATA CATGATAATC TTTCACATAT 
48151 ACCCCATAAG GAGGATCACT TATAGGAGAT TAGACTAAAT AAAATCAGAG 
48201 ATTTCTCATG ACCAAGTTAT GGGATTCTTA ATTCATCATA TTATTTATAA 
48251 AG I I I I I I I I TTCTAAGTAG TTCTTAAAGG AAGGGTAGAA TTTTAGTTTA 
48301 TTCATTCTGA ATCCTGAGCA GAAGCAGCAC ACTAACATAA GTTTTATGAA 
48351 AGTGTCACAA TCTAACCTCT GGAAGGAAAA CTATAAGTTG AAGTCCTTTG 
48401 TGTAATTTGA CGTTGCTGTA AAATTGAGCT GAGTTTGGAG TGACACCTCC 
48451 ATGAAGGCAG GGGCGTGGCT TCTTCCCCAT GTACTCCAGC ACCTAGACAG 
48501 AGCTTGGCAT GTGATAAGTT TCAAGCGAGT GTTGAATGAG TCAATGAATG 
48551 AACAAATGCA TTTACCTCTG AATCACTTCT CTGTCGGCTT TTGTTAACTT 
48601 GGATTATTTG AGCTATTGCT TCAGCCTAAC TCAATGTAAA GGGGAAATAC 
48651 AGAGGTAAGT TTTAGAGTTT GGGTTCTCTT TATGGTCATT AGCAGAACTG 
48701 TCTAGTTGAG CAGCCACAGA TTATGTTTTC CATTATTTAT TCCATCATTG 
48751 TTTATCAAGG ACTGTAAGGG CCTTGAAATT CAACTCCCCC CCCCATAGTT 
48801 TTTGTATTAT TCCATGTAGA TTTTAGATTA TTCTGGAGAG TGI 1 1 IGI IC 
48851 TTGAGCAACA GAATACTCTT GAGAAGATTA CGAAGTCCAG TGGTATCCTT 
48901 TTCTTTGCCT AGGAAATAGA GAAGCAAAAA AAAAAAAAAA AAAAAATTAA 
48951 AGAAAATCTA GTCTCCAGGA TTTTAATTAG AACCTATCCT TGGGAAGGCT 
49001 ATTTTCCTTA TATGAAGGTT TGAAGATTCA AATCATGATT ATTAAGGGCT 
49051 AATGTTTGAG ATACCCTTAG GTTATTCTGA CCACATACTT GGATTTTATG 
49101 ATAGGAAAGC CACAGCCTAA AATAAATAAA TACTCAATGC AGTTATTTCA 
49151 GTATGCAAGA AGTTTGGTAT TTTTGAAAAA GTCCATGGGT ATTGCAAGCA 
49201 AATATGCACA TTTTGCTTTA TGCCATTTGT CAGATTCTTA CCTTGGATAC 
49251 CACCAACAGG CATCCTCTGC TTCTGTCCAC CCAAGCTCCT TCCTGAGACC 
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49301 TCTTTATAGT ATTGTGATTT CTGCACACTA At I I ICI I AG ACATGAAGAG 
49351 AAAGCTGTCT ACACAGTGTG GTGTAGTTTT CTTATGGGCT CTGGACCTAT 
49401 GGTGCTGTTT TCTCTCCTCC TGCTGAAGGT CCATTCATCC CTCGGGGCTC 
49451 TCTAAAAGCC ACCTTCCTGT GACAAGCATA TACTAAGCAT CTCAATCAAA 
49501 GCCAGTTCCT CCCCTGTCCA GCCTCCCTGG AGTGCTGAAT TGCAGAATAT 
49551 CCCAI 1 1 I IC ATTGGATGAT GGAAAACCCA TTGTTTTCCC AGTGGATTGT 
49601 AAATTACTTC GGGGTAAATA GGCTGTATAT ATTCTCAAAT TTCCCAGAGT 
49651 ATGTAACTAG GTCACTTTTA GATTCAGATA GATTTTGTTC CTTGAATAGC 
49701 TAGTACTTTA GGAAACTAAG AAAAAGATCT TTTCAACCTG GTATGTAGCT 
49751 CTGTCAAACA CATCATCAGT ATGGGGTAAA CCTGTGTTCT CTGTGGGTTG 
49801 TCATTACCAT AGTAGTGTCA TTGTATCATT GACAGTGTAA TAGTGTGGGG 
49851 TAGTGTTCTT GTGGTTTCAG CTGCCACTCT GTACTGACTG CTTTCCACTC 
49901 CAACATCTTC CTCTTTATCT CAACACTGTA GGTCTACCTG TGTACTGTGT 
49951 GTTTCAGCAT CTCTGCTPGC ATGACCCAGG AGTGCCTCCC ACTCAATATG 
50001 GCCACCATGC ATGGTCATCT TTCTGCTACT CCCTGTCTCC TGACCCTGCT 
50051 CCAGCAACAC AGACAGACAC CCTTCCTCTT TCTATATGTC ATATGGTGGG 
50101 GAATGCCCTT TAGTACTTAC TCAGGAGTTA GTTCCTCTGG GAAGCCTTCT 
50151 GTTCTAGTTT CCTTTTGTTA CAGCACTTTC ACATTGAATT CTGACGTTCT 
50201 CTGTACTTAT CTGCTTTGTG AGACTGTGAG CTTCCTTAGG CAGTAGCTAC 
50251 TTGTATTCTT AGCACCTTGC CCAGTCCCAG GAAACCCTTA TTAAGTAAAT 
50301 GAAAAGACAG AACTGACAGA CTGGAATTAG AGCTCAAGCT TGCCTCAATC 
50351 TCAAGCCATT AAGATGAAGG GGAGCCGGGC GTGGTGGCTC ACGCCTCTAA 
50401 TCCCAGCACT TTAGGAGGTA GTTTGCTTGA GCCCAGGAGT TCAAGACCAG 
50451 CCTGGGCAAC GTGGCAAAAC CCCATTTCTA CAAAAAATAT AAAAATTAGT 
50501 TGGACGTGGG GGTGTGTGCC TGTACTCAGG ATGCTGAGGT GGGAGGATCA 
50551 CTTGAGCTCG AGAGGCAGAG GTTGCAGTGA GCTGGGATCA CACCATTGCA 
50601 ATCTAGCCTG GGTGATAGAA TGAGACCTTG TCTCAAAAAA AAAATAAATA 
50651 AATAAATAAA GGGGAAGATA AGGATTGGAA ACAGAAGGAG CAGCATGTGG 
50701 ACAGAAATGT AGGCACAAGA AGGCATCACT CACTGAAGAG ACTGAAAGTG 
50751 GTTCACTGTG CCTCAAGACT GGTGGAGTGT GTTTCCGGAA AGATAATGAT 
50801 GAAAGAGCTG GACAGATAAA CAGGGGCCAA ATGTAATAGG AGTCTGGATT 
50851 TTATTCTGAA TATGGTAGGG GCTATTGTAG CATCTTATAT AGGGAAGTGA 
50901 AATGAGTACA TTCACATTTA AGGAATATCA ACCTGAAAAA AGAGTGGAGA 
50951 CATTGTTGGG GGAGAGTGAG GTAGACTAGA GGCAGGGAGA ATATTTAAAT 
51001 AATTGAGGTA AGAAATGATG AACACCAGTA TAAGGTGATG TCTTTAAGGA 
51051 ATGGAGAAGG GAATGAACTG AGAAATATTT TGGAAGTAGA ATCAACAGAA 
51101 CTCACTGACT GACTGGATAT GGAGGTGAGA AAGAGAAGAG TCAAGAATGA 
51151 TATTCTAATT TCTAACTTGA GTGACTGCAT TCAAAGAGAA TACAATATCA 
51201 GGTTCCATTT TGTGCATGCT GAGTTTGAGA TGTGTGGGAC ATGTACAGGG 
51251 AGCTGTCCAG TAAGCAATTG GGTATATCAG CTAGCCATTA AGAGAGAGAT 
51301 CTTTGATAGA GAGGTTGTTG CTGAGTTGAG CCATTGGAAT GGGCAGGATC 
51351 ACTCAAGAAG AGCTTATAAA TGAGAAGAAT TCTAGGAATA AGTCCAAAGG 
51401 GAGAAGTAAA AGAAGAAACT TGCAAAGGAC ACTGAGAAGA AATAGCTCGA 
51451 GGGATGGGAG AAAATCCAGA GAGAGGGATG GCATAGGAGT CAGTGGAAGG 
51501 AA ACGGTT TC ATGGGGGTCA GTACTACTGG GTAGTGAATA TAATAAGAAT 
51551 ATCTTTTAGG ATTTCTCAAC CCAGAGATAG GTAAGCTTAG TATAAATGCT 
51601 TCTGTGAAGT AATGAAATGA GAAACCATGC TGAAATGAGC TTAAAGTGAA 
51651 TGGGAGGTGA AGAAACTTGG ACAGTAGAGA CACAI I I I IA GGGAGTTTGA 
51701 CAGTGAAGAG AAGGAAACTA GAAGAGGGAG AGGGTGATAG ATAAGAAAGA 
51751 TGTTGGGTGG AGGGGATTTG I I I I I I IGI I 1 1 I I IGI I I I TTTTCTGTTT 
51801 GTATGTTTGT TTGTTTTTGA GATGGAGTCT CACTTTATCA CCCAGGCTGG 
51851 AGTAAAGTGG TGCAATCTCA TCTCACTGCA ACCTCTGCCT CCTAGGTTCA 
51901 AGTGATTCTT CTGCCTCAAC CTCCTGAGTA GTTMNNNNNN NNNNNNNNNN 
51951 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
52001 NNNNNNNNNN NNNNNNNNNN NNNNNTGCCT CAGCCTCCCG AAATGCTGGG 
52051 ATTGCAGGAG TGAGCCCCCC GTGCCTGGCC TGGAGGGAGG ATTTTGATTT 
52101 GACTTTAATG TGCCTGTTGC TGAAGGAAGC ATGTCAATAC AAATAAAGAA 
52151 GTTGAAAACA TAGGTAAGAG AGGTTGATTA ACCCGGTAGG TGTTTCAAGG 
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52201 GAGTTTGTGT GTAGGGAAAG GGAGTGGGAG ATGGAAAGGG GCTGGGGGAG 
52251 ACAGGTTCTA TCCAGAGACT GTTAAAAGGA TTAGTCTTTG ATTACAAGAA 
52301 GAACTCTTCT TATACGTGTT TGGGAAGAAA AAATATGTGA GTAGCTATGG 
52351 ATAATTTTGC AGGAGGTGGG CAGAATACCA AGATATTCTG CCTGGTGGCC 
52401 TCTCTACTCT TCCTTGAGCT CCTGAGAAAG GATGTGATCT GAGAATGAGG 
52451 GAGGAAGTGG TATTGGAAGC TGGAGGAGAA TGGAGAAGAT CAAAATGGTT 
52501 AGTCTAACAA ATGGGAGAGA ACTGAGATAG ACAAAAGGAT TTCAGGGTGG 
52551 TTTTGAGGGC TCAGTTAAGT CTCCTTTAGG AAGGTTCAGT TCTGTAGCCT 
52601 TGGCAAGTTA CTTAAAGTCT CTGTGACTAT TACCTCATCT CTAAGATGGG 
52651 GACTAAGCTT GGTGACATAG TTTTACATAC CAGGCACAGT GCCTGACTTT 
52701 TTGGCTCTGT CCTGAAGTCT TCCCTTTGTA TATGGTATGT TTCGGGGAAT 
52751 AGGAGCCTCA AGCACTTATC CTTTAAATAT TTATCCTCCA TCAGTCACTA 
52801 AACGTTTACT CTGTACTTTT GATAGGTGCT GTGGGGGTCC AGGGTATAAA 
52851 AGGTACCTTC AAAGTTACTG TTAAAGTGCA GGAAGGTTTT TAAGCAAATT 
52901 ATGTTTAATG ATTTTGACAA TCTGACATGC AGGAAAATTA ATAGGGCCTA 
52951 TGCAGAAGAG GAG I I I IATG TAACACTCTG TAGTTCAGGA AACAGAGCCC 
53001 TTGGAAGCAG TGATCTCTCT GGGGAGGAAT GTCTGGTATT TGGGAATCTC 
53051 ATGAAATGAT AATATACTTA Al I I I IATCA TGAGCAGCAA AACACAGATT 
53101 TGCTAGGAGA AAGTCATCGT ATGTTGTTGC ATTGGGCACT TTAGATCCCA 
53151 GGGAACAGAA ACTGGCTGGC ACAGGAATGG GCATCACTGT GGGGATGGAT 
53201 CATGTAGGGG AAGGATCCCT GGAGAAGTCC AGGAGGTGAG ACTTCCCCCT 
53251 TCCCTTCTCC ATGCATGAGT CCACTTCTCT CTGTTGACTT TCCCCTTGTC 
53301 CCTCTGGTGA CAGCAGCTGC TTACCTCTGG AGACCCCCTC ACATTTCTGA 
53351 GAGAAGGAAT CTGGCTTGCC TGGCTAATTC CCATGGTCTA TGTTTGGGCA 
53401 GAATGTCTTA GCAAGTTGTG TAAAGATAGT GTATTCATAT ATTAATAATA 
53451 ATAATAACAT CTACTGAACA TTTGCTAGGT GTTCAGACCT GCACTAACCG 
53501 TGTTACAAGT ATTAI I I I I I TGTAATCCTT TCCATAACCC TGTGAGGTAA 
53551 GTACTGTTAT CACAGACAAG GAAACCACAA TGTGGACCTG TTCATGAACT 
53601 TGCTCGAGGC CACGTGGCTC TGGAGTTCCA GCTCAGGTCT GCCTGACTCT 
53651 CAATCCCATG ATATTAATAT ACTGGCCAGT CACTATTTTG GCTGTATTGG 
53701 GGTCATATTT ATACCCTTGG TCCAGTTAGC TATGTTGGGT CACTTTAGTA 
53751 CTGATAGCCA GGGAGATGCT GGGCTTGATA GGTTAGTATA ATTCTATGTA 
53801 TTACCTACAA AAACTGI I I I TATAAATTGT TTTGTTAACA TTTGTTTGTC 
53851 ACCTATTTAT TCATTTTATT TGCACTGGTG AAAATAAACT CATCTTTTAA 
53901 AAACTGTGGG GAAAATATCC AAACATTGTG AAAACTTGAT TAACCTTGTA 
53951 TTTTCTGTAC ACCTGGGGAG GGATGCTGTT ATGCTGTTTC AGCAAAGGAG 
54001 CAACTTGGTC CAATCTGGGA GACATCTGTG TTTTGTGGAA ATCTGACTTG 
54051 AAAACCACTG TCCAGTCACT GCGTGTATTA GCATTTAGGC CTTGCTCTTC 
54101 TGCTATGTAT TATTAATGTA GTGTATACAT TTCGAGACAC ATCATCACAT 
54151 TTGTCAATTT ATTGATTTCT AGGAGCTGAT TTGTATTCTA GGATTGTCTA 
54201 GTTGGCTTGG GCTGCCATAA AATACCACAG TGTGTGTGGA ATCAACAACG 
54251 GAAATTTA7T TCTAACAGTT TCAGAGGCGG GAAAGCCTAA GATCAAGGGC 
54301 CAAGCCAGTT TGATTTCTAG TGAGCGTTCT CTTCTCAGCT TGTAGACAGC 
54351 TGGTATGTGC TCACATGGTC TTTTCTTGGT GCACATGTGA AGGGGGAGAG 
54401 AGAGAGTGGG CTCTCTGGTG TCTGCTCTTA CAAGAACACT GATCCTGTCA 
54451 TGAGGGCTCC ATCCTCATGA CCTCATAACC CTAATTACCT CCAGAAGCCT 
54501 CATCTCCTAA TACCATCACA TGGGAGGTTA CAGCTTCAAC ATATGAATTT 
54551 GGTGGGGGTG CAGCTCAGTC CACAGCAGGT AGTAATGTGC ATTTTAAAAC 
54601 TTGTTTATAC AGTACAAGAA GTTACTTACT GAAGAAGGAC AAAAAATAGG 
54651 AACATTTGAG AGATTTATTT CTGGTTCCAT GGCTGGAGCA ACTGCACAGA 
54701 CTTTTATATA TCGAATGGAG GTGAGTACCA TTGTCAAGTC TGACTGTGTG 
54751 ATGGTGTTCG TGTTGGTTGT CTATTGCTCT CTAACAAGTT ATCCCAAAAT 
54801 TAACAGTTTA AAACAAGCAT TTATCATCGC ACAGTTTCTC TGGGTCAGGA 
54851 ATCTGGAAGC AGCTTAGCTG GGTGCCTCTG GCTCAGGGTT TTTCACAGCC 
54901 CACAGTCAAG ATGGTAGTCA GAGCTTGGAA TCAGCTGGAG GCGGATTCCA 
54951 AGCTCACTCA TGTTGCTGCC AGGCCTCACT GGCTATTGGC TGGAAACATC 
55001 AGTTCCTTAT CACGTGAGCC TTTCTGTAGG CTGCCTGAGT ATCCTCAAAA 
55051 CACAGTAGCT GGCTTCCCTA GAGTCAGTGG TCCAACAGAG AGAGAGAGAG 
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55101 AGAGTGCCTA AGATGAAAGC TGGTATCTTT TGCCTCTTCT GCTCTATTCC <~> 
55151 ATTGATCACA CAGACCAACC CTGGTAGAGT GTAGGAGGGG CTGGTATAAT ^ 
55201 GGTGTTAATA ACCGGAGACA AATATCACTG GGGGTCACTT TAGAGGCTGG Q nrj 

55251 CTGCCACTTT AGAGGCTGGC TGCCATTCCT GTCCAAAGAG TTTCTGTACC ^ 03 W 

55301 ATAAATTTAA TAATGGAATC TCAGGATTTG ATTATATGGT GATTATCCTA rfj O 

55351 ATTAGACATC CTTTCATTAG TGCATAGGTT GGCAAAACAC AGACCTACGG ^ fTj 

55401 ACTGTTTCAT ACAGCCCTTG ACCTAAGAAT GCCTTTTACA 1 1 I I IAAAAA o£ g ^ 

55451 GTGGGCAACA CAGGAAAAAG TGAGAAAGAT CTAAAATCGA CACCCTAAGA -§ S 

55501 TCACAATTAA AAGAACTAGA GAAGCAAGAG CAAACAAATT CAAAAGATAG £g 
55551 CGGAAGACAA GAAGTAGCTA AGGTCAGAGC AGAACTGAAG GAGATAGAGA § 
55601 CACGAAAAAC CCTTCCAAAA ATCATTGAAT CCAGGAGCTG TTTTTATGAA 
55651 AAGTTTAACA AAATAGACAA CTAGCCAGAA TAATAAAGAA GAAACCAGAG 
55701 GAGAATCAAA TAGCCCCAAT AAAAAATGAT AAAGGGGATA TCACCACCAA 
55751 TCCCACAGAA ATACAAACTA CCATCAGGGA ATACTATAAA CACCTCTATG 
55801 CAAATAAACT AGAAAATCTA GAAGAAATGG ATAAATTCCT GGACACATAC 
55851 AGGCTCCCAA GACTAAATCA GGAAGAAGCT GAATCCCTGT ATAGACCAAT 
55901 AACATGTTCT GAAATTGAGG CAGTAATTAA TAGCCTACCA ACCAAAAAAA 
55951 ACCCAGGACC AGACAGATTC ATAGCCGAAT TCTACCAGAG GTACAAAGAG 
56001 GAGCTGATGC CATTCCTTCT GAAATTATTC AAACAATAGA AAAAGAGAGA 
56051 TTCCTCCCTA ACTCATTTTA TGAGGGCAGC ATCATTCTGA TACTAAAACC 
56101 TGGCAGAGAC ACAACCAAAA TAGAAAATTT CAGGCCAATA TCCCTGATGA 
56151 ACATCAATGT GAAAATCCTC AATAAAATAC TGGCAAACTG AATGCAGCAG 
56201 GACATCCAAA AGTTTATCCA CCATGATCAA GTTGGCTTCA TCCCTGGGAT 
56251 GCAAGGCTGT TCAACATATG CAAATCAATA TAACGGAATT CATCAATAAA 
56301 CAGAACCAGT GACAAAAACC GCATGATTAT CTCAATAGAT GCAGAAAAGG 
56351 CCTTCGATAA AATTCAACAC CACTTCATGT TAAAAACTCT CACTAAACTA 
56401 GTTATTGATG GAATGTATAA CAAAATAATA AGAGCTGTTT ATGACAAACC 
56451 CACAGCCAAT ATCATACTGA ATGGGCAAAA GCTGGAAGCA TTCCCTTTGA 
56501 AAACCGGCAC AAGACAAGGA TGTCCTCTGT CAGCACTCCT ATTCAACGTA 
56551 GTATTGGAAG TTCTGGCCAA GGCAATCAGG CAGGAGAAAG AAATAAAGCG 
56601 TATTCAGATA GGAAAAGAGG AAGTCAAATT GTCTCTGTTT GCAGTTGACA 
56651 TGATTGTATA TTTAGAAAAC CTCCTTGTCT CAGCCCCAAA TCTCCTTAAG 
56701 CTGATAAGCA ACTTAAAGCA AAGTCTCAGG GTACAAAATC AATGTGCAAA 
56751 AATCACTAGC ATTCCTATTA ACCAATAATA CACAAACAGA GAGCCAAATC 
56801 ACGAGTGAAC TCCCATCCAC AATTGCTACA AAGAGAATAA AATACCTCGG 
56851 AATACAACTT ACAAGGGATG TGAAGGACCT GTTCAAGGAG AACTACAAAC 
56901 CACTCCTCAA GGAAATAAGA GAGGACACAA ACAAATGGAA AAACATTTCA 
56951 TGCTCATGGA TAGGAAGAAT CAATATCATA TCATAGGAAG AATCAGTGGC 
57001 CATACTGCCC AAAGTAATTT ATAGATTCAA TGATATCCCC ATCAAGCTAA 
57051 CATTGAATTT CTTCACAGAA ATAGAAAAAA CTACCTTAAA TTTCATATGA 
57101 AACTAAAAAA GAGCCTGTAT AGCCAAGACA ATCCTAAGCA AAATGAACGA 
57151 AGCTGGAGGC ATCACGCTAC CTGACTTCAA ACATACTACA AGGCTACAGT 
57201 AACCAAAACA GCATGGTACT GGTACCAAAC AGATATATAG ACCAATGGAA 
57251 CAGAACAGAG GCCTCAGAAA TAACACCACA CGTCTACAAC CATCTGATCT 
57301 TTGACAAAAA CAAGCAATGG GGAAAGGATT CCTTATTTAA TGTATGGTGT 
57351 TGGGAAAACT GGCTAGCCAT ATGCAGAAAA CTGAAACTGG ACCCCTTCCT 
57401 TACACCTTAT AAAAAAAAAA TTAACTCAAG ATAGATTAAA GTCTTAAACA 
57451 TAGACTTAAA CTATAAAATC CCTAGAAAAA AACCGAGGCA ATACCATTCA 
57501 GGACACAGGC ATGGACAAAG ACTTCATGAC TGAATCACAA AAGCAATGGC 
57551 AACAAAAGCC AAAATTGACA AATGGGATCT AATTAAACTA AAGATCTTCT 
57601 GCACAGCAAA AGAAACTATC ATCAGAGTGA ACCGGCAACC TACAGAATGG 
57651 GAGAAAAATT TTGCAATCTA TCCATCTGAC AAAGGGCTAA TATCCAGAAT 
57701 CTATAAGGAA CTTAAGCAAA TTTACAAGAA AAAAAAACCC ACCAAAAAGT 
57751 GGGTGACGGA TATGAACAGA CACTTCTCAT AAGAAGACAT TTATGCAGCC 
57801 AACAAACGTG AGAAAAGGCT CATCATCCCT GGTTGTTAGA GAAATGCAAA 
57851 TCAAAACCCC AATGGCATAC CATCTCACGC CAGTTAGTTA AAAAGTCAGG 
57901 AAACAACAGA TGCTGGCAAA TATGTGGAGA AATAGGAATG CTTTTACACT 
57951 GTTGGTGGGA GTGTAAATTA GTTCAAGCAT TGTGGAAGAC AGTGTGGCAA 
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58001 TTCCTCAAQG ATCTAGAACC AGAAATACCG TTTGACCCAG CAATCCCATT 
58051 GCTGGTTATA TACTCAAAGG ATTATAGATT TTTCTACTAT AAAGACACAT 
58101 GCACACGTAT ATTTATTGCA GCACTGTTCA CAATAGCAAA GACTTGGAAC 
58151 CAACCCAAAT GCCCATCAGT GATAGACTAG ATAAACAAAA TATGGCACAT 
58201 ATACACCATG GAATACTATG CAGCCATAAA CAAGGATGAG TTCATGTCCT 
58251 TTGTAGGGAC ATGGATGAAG CTGGAAGCCA TCATTCTCAG CAACCTAACA 
58301 CAGGAACAGA AAACCAAACA CCACATGTTC TCACTCATAA GTTGGAGTTG 
58351 AACAATGAGA ATACATGGAC ACAGGGAGGG GAACATCACA CACTGGGGCC 
58401 I I I I IGGGGA TGAGGGGCTA GGGGAGGAAT AGCATTAGAA GAAATACCTA 
58451 ATGTAGGTGA CAGGTTGATG GGTGCAGCAA ACCACCATGG CACGTGTATA 
58501 CCTATGTAAC AAACCTGCAC GTTCTGCACA TGTATCCCAG AACTTAAAGT 
58551 ACAAI I I I IA AAAAGTAGGC AAAAACAAAA GAAAAGAAAA GTAATATACA 
58601 ACCGAGACCT AATATTTTAG GCTTGCAACG ACAGATATTT TACTATTTAG 
58651 TCTTTACAGG AAAAGTTTTC CAACTACTGC TTTATAGCAA AAATAATATT 
58701 GTAGATGTGG AATTTATTGA TATAGCAGAG GGGTTTTTAG TAACTGATGA 
58751 CTTAAGCAAG ATAAATACAA TTTTCACCGA TATGTGGTAT GCATGCTAAT 
58801 ACAGCI I I I I TTAAGCATCT TAATATGATT GTTTATATTA CTCCACACAC 
58851 CrCTCAAAAA AACTTAATAC CCTAI I I I IC CTCTCATATC CTCCGATATC 
58901 AGTTAATAGT ATCACCTTCC CAACTCCCCA CTGCCCCATC CTGTGTTCCA 
58951 AGCTAGAAGT ATTGGGGTTA TCCTTTATAC TACCATTTCC CTCACCTTCC 
59001 AGATGCAGGT GGTCACCAGT CAGTTTTGTT AAGACATCAA TAGATTATCT 
59051 TGCTTCCATT TCCTTGGTCA CTTCCTTCAT CAGATCCTCC TTGCAGTAAA 
59101 CGGGTCTCTC TGGCTTTGGT CTTAGCCCCC CAATAGAGGT AATACATGAA 
59151 AGAGAATGTA TCAACAAATT GTACAGTCTT TTGAGTCACA ATATGTGCTA 
59201 GGTATTTGTT CCATGTAAAA TTACTTCATT TGAATCCCAT GATGATAGAG 
59251 TTAATATGAA CAATCATATT TTGTTTTTTT TTATATCCAG GTTATGAAAA 
59301 CCAGGCTGGC TGTAGGCAAA ACTGGGCAGT ACTCTGGAAT ATATGATTGT 
59351 GCCAAGAAGA TTTTGAAACA TGAAGGCTTG GGAGU I I I I ACAAAGGCTA 
59401 TGTTCCCAAT TTATTAGGTA TCATACCTTA TGCAGGCATA GATCTTGCTG 
59451 TGTATGAGGT GAGTTTGTAG AAATCTTTTG AATTGGAAAA TGCAGTTAGA 
59501 TCTTGTTAGA ATTGGACTTT ATATGAAGAA GTAGATATAT ACCAGAAAAC 
59551 AGTGTGTGAC CAGAAGTAAA TTCAAGCATG TGTTATTTGA ACTTTCAAGT 
59601 AACTTGAGTG TGAATATGCA TGGGGTCACT TTTGTATTAG ATTTTCTTGG 
59651 GAATTGCTTT TGTTAATGAA GAGTAGACTC AAAGTTAGGT ATAGTTGTTC 
59701 ACCTTAAAAG GTGTTTCTAG AGAI I I I I IC CI MUM IG GATTTGCAAA 
59751 AATCTGACAT TAAGCCAAGT GACTAATGTG ACTAACATGA GTAATACAGT 
59801 TTCATTCCTT GTACGGAAGA ATACAAATCT TGGATCAACC CTGCAATCTA 
59851 AATCATTTAA TAATTTATGA ATCTCACAAA CAATTATTGA GCACACACTA 
59901 TACAAACCAC TAGGTTAGAC ACTGGATCTG GGGATTCAAA GGACTCAATG 
59951 TGTGCCTTGA AGAAACTGAA GGTCTGGTGG GGGAGACAAA CGACTAAAAC 
60001 TCAGCGTGGT TATCTGTGCT GCGACAGACA TGAGCCAGGG TGCATGTTAG 
60051 GATGAGACCT AAGCTACAGC GTAGAGGAAG AGTGGAATGT GTAATGAAAA 
60101 GAAGAGTCGA Al I I I I I I I I TAAAGAGCTT TATTGAGATT TAGTTCATAT 
60151 TCCTTACATT TCACTCATTT GAAGTGTACA AGCAAATGGT TTTTGGCTTC 
60201 TTACATAATT TTTAAAAATT ATTATAAAAT ATAAAATTTG CCATTTTACT 
60251 AATTTTAAGT GTACAATTCA GTGGCATTAA TTACATTCAC AATATTGTGC 
60301 AACCATCAAC ACTATTTCCA AATCCTTTTC CTCACTCCAA ACAGAAACAC 
60351 CTTAACCTTT AAGCAATAAC TTCCTACCCT CCGTAACTCA AACCTTTGGT 
60401 AACCTCTAAT CTGCTTTCTA TGTCTAGGAA TTTACCCATT CAAGATATCT 
60451 TATAAGTAGA ATCATACAGT Al I I I ICI I I TTGTGTCTGA TTTATTACTC 
60501 TTAGCATAAT GTCTCTAAGG TTTGTTCATG TTGTAGCATG TATCAGAACT 
60551 TCAI I ICI 1 1 TCATGGCTGA GTAATATTCC GTTATGTGTA TATACCACAT 
60601 I I IGI I I ACT CCTTCATCTG TTGAAGAGCA TTTGGATTAT TTCTACTTTT 
60651 CCAACATTGT GAATAATGCT GCAGTGAACA TTGGCATCTG CGTATCTGTT 
60701 CGAGTCTATG CCTTCAATTC CTTTGGGTAT ATATCTCAGA ATGGAATTGC 
60751 TGAGCCATAT GGTCATTCTG TGTTTAGCTT TTAGGAACTA TGAGACTGTT 
60801 TTCCATAGTG GCTGCACTTA CATTCTCACC AGCAACATAC AAAGGTTCCA 
60851 GTTTTTCCAC GTCCTTATTA ACACTTAATT TCCATTTTAA AAAAGCTTAT 
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60901 TTTTATTATG GCCGTCCTCT TAGGTGTGAG GTGGTATGGT TCAGGACTTT 
60951 ACTTCTTGTG CTGAGTTTTT TAAAAAATTG TGATTAAAAA CACATAACAT 
61001 AAAGTTTATG ATTTTAACCA I I I I IAAATA TATAGTACAG TAAGTGTTAA 
61051 CTGTTTGTGG TTTGTTGTGC AACAGATCTC TAGAACTTTT TCACTTCTCA 
61101 AAACTTAAAC TCTATAGTCA TTAAACAACA GCTCCCAATT TCCCCTTCAC 
61151 CCCAGCGCTG TGTAACCTAC TTTCTCGTTT TATGAGTTTG ACTACATTAA 
61201 ATACCTTGTA TAAGTGAAAT CATGTGGTAT TTCTCTTTCC GTCACTGGCT 
61251 TATTTCATGT AACATAGTTT CCTCATGATT CATCCATATG ATAGCATACA 
61301 ACAGG ACTTT TTTGTTTTTA AGGCTGAATA ATAAI I IGI I GGGTATATAT 
61351 ATCACATTTT CTTTATTCAT CTGTTGATGG ACATTTGGAT TGTTTCTACA 
61401 TCTTGACTAT TG TGAATA GT GC TGCAGT GA ACATGGTTGT GCAAATATCT 
61451 CTTCAAGATA CTGTTTTCAG TTCTTTTTGA CATATACTCA GAAGTGGAAT 
61501 TTCTGGGTCA AATGGTAATT CTAI I I I IAA GTTTTTGAGG AACCTCCATG 
61551 TCATTTTCCA TAGTAACTAG ACCTTTTTGT I I I I IAACAT TTCTATCAAT 
61601 GTACACCAAG ATTCCAATTT CTCCATGTCC TCCCCAACAC CATTAAGTGG 
61651 GGTGGTGGTC TACTACTATT GCTGTGTTGC TGTTTATTCC TCCCTTCAGT 
61701 TCTGTAAGTG TTTGCTTCAT ATATTTAGGA GCTTAATATT AGGTCCATAT 
61751 GAAGTTATAA I I ICI ICCTG GTAAAGTGAC CCATTTATCA TTATGTAATG 
61801 TCCATCTTTG TCTCTTGTGA CAGTTTGTGT CTTAAAATCT ATTTTGTCTG 
61851 ATGTAATTAT GGCCACCCCT TTTCTCTTTG GGTTCCCGTT TTTATGGAAT 
61901 ATCTTTTTCC ATCCTTTCAC TTTCAGCTTA TGTGTGTCCT TAGATCTAAA 
61951 GTGAGTCTCA TAGATAAGGT ATAGTTGATT CTGTATGTGT TATTCACTCA 
62001 GCAATTTATA TCTTTTAGTT AGGGGATTTA ATCCATTTAC ATTTAAAGCA 
62051 GTTACTGATA GGGAAGGACT TACTGTTGTC ATTTGGCTAG CTAC C I I I I I 
62101 ATCTTTGTCC TGTGGCI I I I CTGTTTTTCC CTTCCTCTCT TCCTGGCTTC 
62151 TTCTGTGTTT TGTTGATTTT I I I I I I I I I I GTAGTGATAT GTTCTGATTC 
62201 CCTTCTCATT TCCCTTTGTG TGCATTCTAT AGATGCTATT TTTGTGGTTA 
62251 CCATTGCAAC TACATAAAGC ATACTAAAGT TATAGCAACT TATTTTAAGC 
62301 TGTTTACAAC TTAACTTCAG TGGTATATAA AACTCTATTT CTTTACATAT 
62351 TTCACCTCCT CCCCACAAAC TTTATGTCTT TTGATATTGT ATATCCTTAA 
62401 CATAGATTTA TAGTTACTTT TTATGCTTTT CI ICI I I AAA TTCTGTTTAA 
62451 Al I I IGI I I I TGAAATTTAG ATTTTCAAGT TATTTATATA CCTTCATTAC 
62501 AATAC TATAG GATTTTATAA TATTCTAAAT ATTGACCTTT ACCATAGAGT 
62551 TTCATATTTT GTGGTTTTGT GTTGCTATTT ATCATCCTTT TGTTTCTCCT 
62601 TTTAGCCTTT CTTGTAGGGC CGGTCTAGTG GTGATAAGCT GTATCAGCTT 
62651 TTGTTTGTCA GGGACAGTCT TAATTTCTCC I I I I I IGAAG GGCAGTTTTG 
62701 CCCATACAGT Al I I I IGI I I GGCAGI I I I I TTAAGTTTCA AAACATAGAA 
62751 TATAACATTC CATTTCCTTC TAACCTGCAA GATTTCCATT GAGAAATGCA 
62801 CTCAATGGAT I I I I IAATCC ATTGAGATAA I I I I I IAATC CTGTAGGATT 
62851 TAAAAI I I I I AGTCTTACAG GATTAAAAAA TTAAAAAGTT AAACTTGTTA 
62901 TATAACATAT TAACATGTAT TTTATACTTA AAGTATCTTA TGTTTAAAAA 
62951 GTTGATTATC ATATATATTT TATACAGTTT CTCCTAATTA TTGCCTTCTA 
63001 ATGAAATACA GGGACCTAGA GTAACAGGGA TAAAGTATGG CCTTTTGATC 
63051 AGCACGCCTG GTTCTGAGTC CTTCTTAAAA AAACTCTGGG CCTGGTGTGG 
63101 TGGCTCATGC CTATAATCTC AGCACTTTGG GAGGCCGAGG CGGGCGGATC 
63151 ACCTGAGGTC AGGAGTTTGA GATCAGCCTT GCCAGCATGG TGAAACCCTG 
63201 TCTCTACTAA CAGTACAAAG ATTAGCTGGG CGTGGTGGTG GGTGCCTGTA 
63251 ATCCAAGCTA CTCAGGAGGC TGAGGCAGAA GAATCGTTTG AACCTGGGAG 
63301 GCAGAGATTG GGCCACTGCA CTACAGCCTG GGTGACAAGA GCGAGACTCC 
63351 ATCTCAAAAA AACAAACAAA AACTCCGCTG AGATGAATTT TTCTCATTTC 
63401 TAAAATCAGA ATAATAGATT TATGTAAGAG TTTCTGTAAG GCTCAAATGA 
63451 AATATATGTA ACGTGTAAAA TGAGATACAA TTAGTAGAAT TATATTATTT 
63501 TATTAATACT CACCATAAGA GGTGTTCTTT AGATCCTGCA GCGTTTGCTG 
63551 CGCAGTTCAC Gl I IGI I I AG AAGAATGTCA GTAACCGGTG CAAACCTCAT 
63601 GTGTTCCGCA CCCCCAGTGG CCTCCCACCT CTCCACAGAG TCACCGCCTC 
63651 CTGCAGTGCC TGCTGCTTCT GCAAATGCGT GGCCTCATCC TGCAGAAACG 
63701 GGGCTTCTCA TGAGGTTGAG AATAGCTGTG AAAATGTTTA CGTTGAAGTT 
63751 GTAGAGTTCG TTAATTATTT TCI ICI I I AT TTCTCTGGCA GCTCTTGAAG 
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63801 TCCTATTGGC TGGATAATTT TGCAAAAGAT TCTGTAAACC CTGGAGTCAT 
63851 GGTGTTGCTG GGATGCGGTG CCTTATCCAG CACCTGTGGT CAGCTGGCCA 
63901 GCTACCCATT GGCTTTGGTG AGAACTCGCA TGCAGGCTCA AGGTGAATTT 
63951 TTGATTACAG AACCACACCG ATAAAAGTGC TGCACCAGTA ATGTGCTTTT 
64001 AGAACTCCAA GTTCTACTAA GATGCAGACT GTAGTTTTAA GACAGTATTT 
64051 CTCAACCTTT TTTTCATTAT TGCCTCCTTA AGGAATCTTT TCAGAAATTC 
64101 I I I I I CTAAA TGCTCCCTCG TCATGAAATT TTAATGCGAC AGAAGCATTG 
64151 CATATGTACT GTATGCATAC ATATGCCTTA TAGATAAACA GAGTACTATT 
64201 I I I I I IGACT GTGTTACATG CACGTTTTAA GATTATAAGC TTTAGTATCT 
64251 GATGGATTTG GGTTCAGATC CTTGCCTCAG ACTTCTTGGG GTTTTTAATG 
64301 GGAATGAAAA TTGTACAGTG TTGTAAGAAT TACCAACAAT ATAAATAAAG 
64351 CATCTTGGGT TTGTTAAATT TTTGGTAAAT GGTGGTTGGA ATCAI I I I I I 
64401 AGTGTTGCGT AGACCCTACA AGTTTTGAGC TGTGATTCCT CCTCACTGTG 
64451 ACACTGTCTC CATTGTTGGC TTTGATTACA CTGTACCATC CTGGTTGTTC 
64501 TGCCAGCCCA TTGATAACTT TTACCATTTG CTGGCTTTTA TTGCTATCCC 
64551 CACTCTATTA AAGTATGCAT TCAAATGCCT TTCTTTTCTC TTTGATGCTT 
64601 TCCCTGGTCA GTCTTATCCA TTGTTTTCTT AAGTAGTACA CCTTGGGCAT 
64651 CTACAGCTCT ATTCCCAACC TCCCTTCCAA GTGCCAGCCA CAGCAACCCC 
64701 AGCCAAGCAG TCAGTAACTA ATTGGCAAAT ACTCCCTGAG CCATTGTCCC 
64751 ATTCTAGACA CTGCCAGATG CTAGGGGTAG AGCAGTCAAC AAGTCAGGTG 
64801 TGGCCCCGCC AGTGTAGAGT AGAGAAGACG TTATGTCCAG CAAGTAAACA 
64851 ACCTGGTTAA ACCAACTCCT CI I I IGI I AG GGGAGCACAG AGCAAGGAGC 
64901 TATAACCTAA CTTGGGCGCT GCAGAATGCT GTCAGTGAAG CTGAGACTGG 
64951 AAAGATGAGT GGGAGTTAGC TGGGCACAGG CCAGTGGAGT GGGAACAGAA 
65001 AACATTCCAG TTGAGGGAAA GCATGTGTGA AGACACTGAG GCAGGCACCA 
65051 ACATGGTGTA TTTAAGGAGC TGAGAGACAG TCATGGCTGT AGAGAAAAAC 
65101 ACAAAGTAGT GAACTACACG TTTCTTGTGT ATTCTCTCAT TTCACCATCA 
65151 TAACCATCTT GGGGATGGGA ATACTAACAT TATCCCCATT TTTCAGATGA 
65201 GCAACTGGGG CAGAGAGAAT TTAAGTAACT CCCACAAGAT TATACCTGTG 
65251 GTAAATAGTG GGACTGAAAT TCAGACACAT GCAGTCTGAT TCTAACCCTC 
65301 CTGTCTGCCA GCTCTGATCC AGAACTTTGC ATGACTGATA CGGCTGATAG 
65351 ATTGTCTATG GCTGATAGAC TGTCATTTCT GACCTAAAAG TCTGATCATT 
65401 TTACATCTGT TCAGACATCT TTGCAGCCTT TCGGTGTCAG TTCCAAAGTT 
65451 GTTAGTGGGA ATTTCAAAGC CTTTAATAAT CTAGCCCCAC I I IGI ICACT 
65501 CTCTGTGTAA TAACCACATA CAACAATTGG CTGCATCTCC ATAGCACATG 
65551 GTACTCCTCC CGTTGTCTTG GTTGTGCCAG CAACACTGGT TTTCGCTTTC 
65601 TCTTCCTGCT TGTTGAGGTC ATTTCCAAGG CCCAGGTCTT TGTGCI I I I I 
65651 CCCAAGCTTC CCAGAGCTTC TTCCATACTC CCCTTACTTC CTGAGATTTA 
65701 ACTGTTCTCT CTTCAGCGCT TGTCTAGTAA GAAGGAGGCA GCAGCAGCAC 
65751 TGTGGGGTGG TGGAAAGTGT ACCAGCTTTG GAGTCAGACC ATTGGATCTC 
65801 AGCCCTACCA TTTTCTACTT AGAI I I I I I I AGGACAAATT TCTCCATCTT 
65851 TCTAAGCCTC CAATTGCTCA CTTACAAAAT TGATATAACA TTTACCTTGC 
65901 AAGATTGGTA TGGAAGGTAA TTAACCCAGT ATTTAGAACA TAGTAATTAA 
65951 TAAATAACTA TTATTACCAT CATTACTATA GTTAGGACAC TCACTGTTAG 
66001 GTGCTATACA AAGAGGATCA TAAAAGGGAT GTTGTCTTGG GCTTCTTGGA 
66051 ATAAATGTTG TCCTTTTACT GTATTTTAGA ATATCATTCT GGGTCATAAT 
66101 TGTTTGTTGT CATAATAATG AAACATACTT GAATATTAAA TTACCCTCTT 
66151 TTTTTATTTT TTAGCCATGT TAGAAGGTTC CCCACAGCTG AATATGGTTG 
66201 GCCTCTTTCG ACGAATTATT TCCAAAGAAG GAATACCAGG ACTTTACAGA 
66251 GGCATCACCC CAAACTTCAT GAAGGTGCTC CCTGCTGTAG GCATCAGTTA 
66301 TGTGGTTTAT GAAAATATGA AGCAAACTTT AGGAGTAACC CAGAAATGAT 
66351 GTTGCATTTT TTGCTTTAGC CTGATAATTG AAACTTTCAA CAATCTCTGG 
66401 AGTGACTTTT TCTCCTCGAA TTGAAACAAG TCTATGGCAA AAGAAGCTGC 
66451 Al I I I I I ICA CAAAAGGGAA GATGGTAACA ATGGTCACTT CAAACTTTTG 
66501 GGCTAAATTA TATGTACACA GAAATGTTCA AAATCATAGT TTTAATGTGT 
66551 TTTGAAAAGG CCACACAATT ATACTTTATC TTTTCTTAAT AATCCTGCAA 
66601 ATCTCTGCCC TGAATCCGAA ATCTGAAAAT GTACTGGCTT GAACAAAATT 
66651 TGTTTTGTGT GTTAGAGTTA TAAATCATTA ATCTTTATTT CGGGTGGTTT 




FIGURE 3W 




Docket No.: CL001103 
Serial No.: 09/777,921 
Inventors: Gennady MERKULOV et al. 
Title: ISOLATED HUMAN TRANSPORTER ... 



66701 ACGTTTATGC CAGTTCCTTT ATATTTAAAT TTCI ICjI I I I ATATATTTTG 
66751 AATGTCTTTA TAGAI I I C_ I I TAAATTTCCT TATAGAACCA TTAATAGAAA 
66801 ATCATTACAT TTAAAATATA CCTTACAGCA AAAGCATCCA AATAAGTATA 
66851 GQGTTTATGT CCTTAI I I I I CTTTCAGCTG AATACGAATG AGCACAGTGG 
66901 TGGAATTTCT GAAGGGAAGT GATGAAATTA TATTTATTTC AGTGGGCACT 
66951 TTTCCATTTT ACCACTGTAC CATTATTTGG TTCCTGGAGT TATACACTAA 
67001 TTTTCAGTAT ATTACTGTTA AATTACCAAC ACAAGGCAAT TTATTTGAAA 
67051 GATTCCGTTT ATCCTGCCAT TGCTTTGAAA AGCAGCAGGA AACGAAATCC 
67101 TTTGACTTGT ATCAGCTTCT GCAGAGCATC III Ca III ICC TTTGTCCTTT 
67151 GTTTCCTACC TTTTGAATCA GATTCCGTTT TAGTCAGGAA GACTTCTTGG 

67201 GACCATTCTT AGTAACCTGA AAI I I < I I I I TTAATTGCAT GAAGTGGATT 

67251 GATCATGAGC AAATG ATGTG CTTATTTCTC CCTCACTGTT GAATATCTTT 
67301 GAACTTGCTG TTTTCAATAT GGGCAGCACA AAGGTGAGAG ATACATATTA 
67351 ATAGTAGTAT GTATTACTCT TATACATTAG ATACCTATAT TTAAATGAAA 
67401 GGCCCAATTT GTAAACATAT ACATTCATAT TCTCTCTTGC CCCAAGTTTT 
67451 AGGAACATGT TAGGATATAG GAGACTTAAT TTATAATAAT GAGAGCATTT 
67501 TTTTATTTTA CTAAAGCCAT TTTTATAGTC AACTATCTTT TCTTATTTGT 
67551 GTGATTAGAA CTTAGAAAAA TATTTACTAG TTGAAGTTAT TATCAGTTTT 
67601 TAATTTAGTT CTTAAACTCA TTTCACTTCT AATAATTTCT GTTATAAATT 
67651 GCCAGCATTT TAATGAAAAT CTAATGATGT AATAGGCATT TTCTTTATTT 
67701 GAACCTACCT CTTTTATTTT CTGAACCAAA GAGAAAGATG GACTGGTGTT 
67751 TGTGAAACAT TTTTAAAAAT GTAGTTTCAT TTATATTAGT TATGTTTGAT 
67801 AAATGTCTCA G TAI I I I IAT AATATGATAA GCCTGGGATT CTACTTTTAG 
67851 GGTTATTTGT ACTTTTGAGT AATATATAAA GTGACAATAT TAAGGTACAT 
67901 GATCAGCTCT TTCTAI I I I I ACTCGTAAAA ATTATGGAAA TGAATAATTT 
67951 TGCTAACAAC TTTGAAATTT CAAACTTCTG GAAAATATGA AAATATTCAT 
68001 TGTTCATTAT GAATTTAAAT TGTAAGGTAT GAATGTGATT TGTCTGTACA 
68051 TCTTGTATCT TTTCCAAAAA ATGATTCTGT ATCTTTTGGA AAAAAGCCGA 
68101 GAGTTGAAGA TAGTATATTT CTGGTAGTAC TGAATATTTA CTTACAGTTT 
68151 CTATCAAAAA TATATATTTG TTTCTAAAAT TALI IGI I I I CCAGTTTTTA 
68201 I I I I I I I IAG AGAAAATTCT TAAGTCTCAG TTTCCTAATT GAAAAAAAAA 
68251 AATTATAAAT AAAGCAAAAA TTGTATCCTA CAGCTTAGCT AGCTTAGATG 
68301 TTTGGCACCA GTTTGAATCA TGCTTTTTAC AGCTGGCTCC ATGTAGTCTT 
68351 TCCAAACATT TTGGCCTTTC CTGAGCAGCC CTTGTAGATA TTGTCTGTAT 
68401 GATGCATTTT GACACAAGGT GATAI I I I I I GTGATATCAA AATTCCACAT 
68451 TTACCCATTA GAGTTACAGC CCTGGGGTTC ACAGTACCAA GGGGGACCCA 
68501 GAGCCTCAGG ATTGGCCAGG CTCATTTTGC CGTGGAGTAT CAGTTTGTCT 
68551 TGAAATTGTG GGAAAAAATT CTAAGTTGAA TTCACTGGTA AGTAAI I I I I 
68601 TAAAATTTCA TAATGCAGAT TACATCCAAA ATTTGATTTA AAAATTAAAA 
68651 CATAAGACTG CAGAGAAATT CTGCATTTCA ACTCCAATAC TATCCAGACT 
68701 TCAGAAATAA CTTATCAGTT ATTTCTGTAA GCTTCTTGCT TACCTGGATA 
68751 CCTGACAGGT GAGATGGCTG TAGCAGACAC TGGCAGTTCC CTGCCCACAC 
68801 ACCTGTCCCT GTCCACAGCT GCACAAGGCA GCTCTGTGTG CAATTGCCAG 
68851 CATCTGCTCC TCTGTTCTCA GGGAATCTTT GTTAGAAAAA TGCTGCCATA 
68901 TTTGTTTCTC ACCTATTAGT CTTGTCTCCC AGTCAAGAGA ATAAATTTAT 
68951 GCAAGCAGAG ATTGTACTTT ACAGTATTTT GTCTTTGAGC TTGGCATTAG 
69001 GTTGCATTTG TAAAAATGTG GCATGGCTTC CTCATCCCCC AATAGGAACT 
69051 TTGCCAGCCC TTTTGTTCTC ATGGAACTTC (.Mill IGAA AAGAGCACCA 
69101 AAGGAGTAAA AATACTGTGG AGGGAGCAAC CCTCCTTTGC CATATGCTCT 
69151 CATTGGGAGA CATGTGGAGC AGTCTGAAGT CATTTAGGCC ACTCTCTGGG 
69201 AGAGCACATC CTATGATGTT CTCCCAGCCT AGCCCCTTCC ACTGTGCTCA 
69251 AGTCCAAGCT GACCAGCTTT CTGACCACAG TGTAAACAAA GATGATTGTC 
69301 AGTGGGCCCC AGAATCCTAT ACCCAGA (SEQ ID NO: 3) 
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TTGCCCACGCAGATGGCTGTTGATCTTTTCTGCMCAMTCCAGGAG I I I I IG 

TTTTATMTTGCTCCAATAGATGCTrTAGGATTTMCTCTCTGL I I I I I AAAGCAGAATC 

GCCATCCCAGGTGTGCMCCACGAAAAMTTAGACATCCGTGAGAGACMTGCCCTCCAT 

GGCCCAGTTTCCAGGCAGAGAGMGC^GCrCTGGGCrGACCGCCAAGGCTCCGGCCCGAG 

AGGGTCTTTMGTGGAGTMCCAGTCTTCMGACCCCGCTCCCMGCCACCGACGCG 

[G.C.A] 

CGCTGCAGCCCTGGACCTGCTGGGGGCCTGTCCTCGGACCCGCATGCT 
TGGCAACTGGGCAGAGGTCGACCCCGGGTC^^ 

GCTCCCTCACTTCCGGCTCTCTGGAGGCGGGCCCGGCCAGTGCCGCCGAGGCCAGCGCGG 
CGAGCTCCTCCCCAGCAGCGGCGGGACGGCCACACCCTGCGCGCCGCGCGGGCTCGGGTG 
GGGTCTCCGCTCCTGCGCCCTGCGCGCCGCAGCCGCACCCCCGACGGCGCCCC^MCGCT 



AGTTTCTCL I I I I IGI I I I ATMTTGCTCCMTAGATGCTTTAGGATTTMCTCTCTGCT 
TTTTAAAGCAGMTCGCCATCCCAGGTGTGCMCCACGAAAAMT^^ 
GACMTGCCCTCCATGGCCCAGTTTCCAGGCAGAGAGMGCAGCTCTGG 
AGGCTCCGGCCCGAGAGGGTCTTTMGTGG^ 

GCCACCGACGCGCTGACGCTGCAGCCCTGGACCTGCTGGGGGCCTCTTCCTCGGACCCGC 
[C.G.A] 

TGCTGAC^GCGGGACrGGCAACTGGGCAGAGGTCGACCCCGGGTCCGCACAGCACCTCCC 

GAGACCCAGCTCCCAGCTCCCTCACTTCCGGCTCTCTGGAGGCGGGCCCGGCC^ 

CCGAGGCC^GCGCGGCGAGCTCCTCCCCAGC^GCGGCGGGACGGCCACACCCTGCGCGCC 

GCGCGGGCTCGGGTGGGGTCTCCGCTCCTGCGCCCTGCGCGCCGCAGCCGCACCCCCGAC 

GGCGCCCCAMCGCTGTTGCGCCGCGCGCCCCGCCCAGCCCGGCCTCGCGCrGGTCCCGG 

TCGCCATCCCAGGTGTGCMCCACGAAAAMTTAGACATCCGTGAGAGACMTGCCCT 
ATGGCCCAGTTTCCAGGCAGAGAGMGCAGCTCTGGGCTGACCGCCAAGGCT 
AGAGGCTCTITMCTGGACTMCCACTCTTCM 
TGACGCTGCAGCCCTGGACCTGCTGGGGGCCTC^ 

GACTGGCAACTGGGCAGAGGTCGACCCCGGGTCCGCACAGCACCTCCCGAGACCCAGCTC 
[C,G] 

CAGCTCCCTCACTTCCGGCTCrCTGGAGGCGGGCCCGGCCAGTGCCGCCGAGGCCAGCGC 
GGCGAGCTCCTCCCCAGCAGCGGCGGGACGGCCACACCCTCCGCGCCGCGCGGGCTCGGG 
TGGQJTCTCCGCTCCTGCGCCCTGCGCGCCGCAGCCGCACCCCCGACGGCGCCCCAAACG 
CTGTTGCGCCGCGCGCCCCGCCCAGCCCGGCCTCGCGCTGGTCCCGGTCTCGCCCCGCAG 
CCCTCGATCTCCCGTGACTTCCTCGGCCAGGCCGCCTGCGCCTCTGGG^ 

CMCCACGAAAAMTTAGACATCCGTGAGAG^ 

CAGAGAGAAGCAGCTCTGGGCTGACCGCCMGGCTCCGGCCCGAGAGGOTCTTTA 

AGTMCCAGTCTTCMGACCCCGCTCCCMGCCACCGACGCGCTGACGCTGCAGCCCTGG 

ACCTGCTGGGGGCCTCTTCCTCGGACCCGCATGCTGAC^GCGGGACTGGCMCT 

AGGTCGACCCCGGGTCCGCACAGCACCTCCCGAGACCCAGCTCCCAGCTCCCTCACTTCC 

[T,G] 

GCTCTCTGGAGGCGGGCCCGGCCAGTGCCGCCGAGGCCAGCGCGGCGAGCTCCTCCCCAG 

CAGCGGCGGGACGGCCACACCCTGCGCGCCGCGCGGGCTCGGGTXXXXn'CT 

CGCCCTGCGCGCCGCAGCCGCACCCCCGACGGCGCCCCAMCGCTGTTGCGCCGCGCGCC 

CCGCCCAGCCCGGCCTCGCGCTGGTCCCGGTCTCGCCCCGCAGCCCTCGATCTCCCGTGA 

CTTCCTCGGCCAGGCCGCCTGCGCCTCTGGGACCATGTTGCGCTGGCTGCGGG^ 

CAAGGCTCCGGCCCGAGAGGGTOTTMCT 

MGCCACCGACGCGCTGACGCTGC^GCCCTGGACCTGCTGGGGGCCTCrrCCTCGGACCC 
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GCATGCTGA(^GCGGGACTGGOV\CrGGGCAG^GGTCGACCCCGGGTCCGCA(^GO\CCr 
CCCGAGACCCAGCrCCCAGCrCCCTG^CTTCCGGCrCTCTGGAGGCGGGCCCGGCCAGTG 
CCGCCGAGGCCAGCGCGGCGAGCTCCTCCCCAGCAGCGGCGGGACGGCCACACCCTGCGC 
[G,T] 

CCGCGCGGGCTCGGGTGGGGTCTCCGCTCCTGCGCCCTGCGCGCCGCAGCCGCACCCCCG 

ACGGCGCCCCAMCGCTGTTGCGCCGCGCGCCCCGCCCAGCCCGGCCTCGCGCTGGTCCC 

GGTCTCCK:CCCGCAGCCCTCGATCrCCCGTGACTTCCTCGGC(^GGCCGCCrGCGC 

QGGACCATGTTGCGCTGGCTGCGGGAGTCGTGCTGCCCACCGCGGCCTGC 

GAGCAGCCGACGCGCTACGAGACCCTCTTC^ 

GCCACCGACGCGCTGACGCTGC^GCCCTGGACCTGCrQQGGGCCTCTTCCT 

ATGCTGACAGCGGGACTGGCMCTGGGCAGAGGTCGACCCCGGGTCCGCACAGCACCTCC 

CGAGACCCAGCTCCCAGCTCCCTCACrrCCGGCrCTCTGGAGGCGGGCCCGGCC^ 

GCCGAGGCCAGCGCGGCGAGCTCCTCCCCAGCAGCGGCGGGACGGCCACACCCTGCGCGC 

CGCGCGGGCTCGGGTGGGGTCTCCGCTC^ 

[A,C] 

GGCGCCCCAMCGCTGTTGCGCCGCGCGCCCCGCCCAGCCCGGCCTCGCGCTGGTCCCGG 
TCTCGCCCCGCAGCCCTCGATCTCCCGTGACTTCCTCGGCCAGGCCGCCTGCGCCTCT 
GACCATGTTGCGCTGGCTGCGGGACrrOT 
GCAGCCGACGCGCTACGAGACCCTC^^ 

GGACATCGGCGAGCTGCAGGAGGGGCTCAGGMCCTGGGCATCCCTCTGGGCC^ 
TGGGGCCGCGACCGGCGACCCCGCTTMCAGMCT 

ATTTCTCCAGATAAMTGACTGTTCTG I I I I I 

MGACACTTTTGTCCTGAATCCATCCCAGGI IU I IGI I I ICTGI I I I AATACCTTGCAG 
ACATGTAATCCG I I I I AGCTGTCAGACTTCAGTGGGTCCCAAG I I I I GTATAAAGGCGCA 
CACATTCGATCTCTTTCGAAGCTGCI I IGI I ACAGCAGCTATGTGTATTGTCTACTGTTT 
[C.G] 

AAMCTGTTTGAAAACCMTCGCGTGTTC^ 

ATTCCATTGTTTMGACATTCCTAGGTTMTGCCCrAQCT 

TTGACTTGACCTGCGACTGAGCAATTTCATTTTCTCTGAGTCATCT^ 

MCTTCTGCCCCTTTAGTAGGGTGGAGATATGTGGMCTTCTCC^ 

TCCCTGACACTGGCATTCTCTTATCCAMGAGGGAMGrGATTAGGTTACTATG^ 

GCTGATTGTCCCAGAMTGGCCCAGTTGGAGTTCCCC^CCATOTCCAATC^ 
AGCAGCCCAGGAMGGGACGACCTTGCTGCAGTGCATCAGCAGATGCCAGGGTT^ 
TAGAGAGTGGMGTCMCTGTGTTCCTCACAGTAGGTGCCTTTGMGGGAG^ 
GTACAACTCCATGGTCCCTACMTATACAAMGCTC I I I I IAA 

GATrGTAMGGGATCCTGAGATCAAAMGCrrGAGMTTGCTGCTGTATCACCA I I I I I A 

[C,T] 

GTMCTG^TCATATTCTGTTATATGrrTGTGTCATAGTATATGTTACCAATTCI I I I I A 

MTCACCITTTACTTTATTGATAGTTTAAAMCGATTGTMCT 

CCTTTGT ATTCATTTTCTCATTCTGGTCCAGTTACTTTCGTAGG^ 

GGACATTGCTGAGTCTGAAGGTMCAOXCATTTTAMCTGGGATACOT 

AAACCTT AGACCCATTTTCACTCTTTTGACTGACAGTGCTTGCTTCTCC^ 

GMGGGAGATCTCAGTGCTACMCTCCATGGTCCCTACAATATACAAM 
TGCTCAATGAI I I I I MGATTGTAMGGGATCCTGAGATCAAAMGCTTGAGAATTGCTG 
CTGTATCACCA I I I I I ACGTMCTGCATCATATTCTGrrATATGTTTGTGTCATAGTATA 
TGTTACCAATTL I I I I I AMTCACCTITTACTTTATTGATAGTTTAAAAACGATTGTAAG 
TGAMTTGOV\TGGATGTCCTTTGTATTC^TTTTCT 
[G,A] 

G7VTAMTTTTGAGGAGTGGACATT«T^ 

TACGTATTGCCrrTCGGAMCCTTAGACCCATTTTCACTCTTTTGA 

TTCTCCACATCCTCGCTCATTCAGGGTATCAGTCTTTGTAMGTCTCCT 

GAMTTCCTTTTCATTTCCTCTrcrrACT 

ACAGGGTMTTTATAMGAAMGACATTTATTTAGCTCA 
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CAGTTACTTTCGTAGGATAMTTTTGAGGAG^^ 
CATTTTAMCTGGGATACGTATTGCCT 

ACTGACAGTGCrTGCrrCTCCACATCCrCGCrCATTCAGGGTATCAGrC 
TCCTATTCTGCAGGrGAAATTCO 



TTGTAAAGTC 
TTCATTTCCrGTCrTAGrCCATTTAGTGTTGCTAT 



ACTGGAATATCTGAGA(^GGGrMTTTATAMGAAMGA(^TTTATTTAGCTCACAGTTC 
[C,T] 

GCAGGCTGGGAAGTTTMGMGCGTGGTGCTGGCATCTGCTGGACT 
TCCTGCTGrGrCACMCATGGrGGAMGrCAMGTGGAAGTGGACATGrGTGMGAAGCA 
AMTCCGAGGGGTCTCCrGGCTITATAGCMCCCAGCCrCGAGGGMCrGATCCATTACr 
GAGGGAACTMTTC^GTCTCATGAGAGAGAGMCTCAC^^ 

AAGCG^TTCATGAGGGATCTGCCTCCGTMCCCTGACACCTCCrGCTAGGTCCCrCCrCC 



CATTTAGTGTTGCTATAGTGGMTATCrGAGACAG^^ 

ATTTAGCrCACAGTTCCGG^GGCrGGGMGTITMGAAGCCTGGrGCrGGCATCT 

ACrCCroQGGAGGGCTTTCCTGCTXnXjTCACMCATGGrGGAAACT 

A^TCTGrCW^GMGCAAMTCCGAGGGGTCTCCrCKKTTTATAGCAACCCAGCCrCGAG 

GGMCTGATCCATTACrGAGGGMCrMTTCACTCTCATGAGAGAGAGMCTCACrCACr 

[A,G] 

CrGCMGMTGAC^CCMGCCATT^TGAGGGATCTGCCrCCCTMCCCTGA^CCrca" 
GCTAQGTCCCTCCTCCCMCACGGCCAGVrCAGGGATCAGACrTCAAGVTGAtj I I I I IGT 
GGGGACAMCAAAACGrAGCACTTGCTTTGCCI I I I GCTTCTATTCACATCCTCCAG^GG 
ATTGCATTATGCCTACCCATTTGGrGAGGGG\Gr(_ I I CI I I AATTGGTTTACTGATTCAA 
ATGCTACCCTCCTCCAGAGAG^TCCTCAGVGACACACCCAGAAATCATG I I I IACCAGTT 

TTCCTGCTGTGTCAG\ACATGGTQGAAAGTCAMGTQGMGTQGACATGTCT^ 
AAAATCCGAGGGCTGTCCTGGCTrTA^^ 

TGAGGGAACTMTTCACTCT^TGAGAC^GAGMCrCACrCACTACTGCAAGAATGACAC 
CMGCCATTG^TGAGQGATCTGCCrCCGTMCCCTGA^CCTCCrGCTAGGTCCCTCCTC 
CCAACACGGCCACATCAGGGATCAGACTTCAACATGAG I I I I I GTGGGGACAAACAAAAC 
[G,A] 

TAGCACrrGCnTGCCrrTTGGTTCTATTCACATCCTCCACAGGATTGCATTA^ 
CCATTTGGTGAGGGCAGTC I ILI I I AATTGGTTTACTGATTCAAATGCTACCCTCCTCCA 
GAGACATCCrCACAGACAG\CCG\GAAATCAT(3 1 I I I ACCAGTTATCTGGGCATCCCTTA 
GTCCAGACGAGTTGATACATAAMTTMCCATCACACATGGGATAGM 
AGTG\ACCTITATGGGAGAAMTTTCAGAGGCATGTCAGGGGTTTATGTM 

TGTTTATTGCATTGACTGGMTCAGGAT^ 

AGAGGGTTCATTTCA I I I I I ATTTCATTAATATTGC I 1 1 I I I I I I I I I I I I ICTGGAGAC 

AGMTCTTGCTCTATCACCMGGCTGGAGTGCAGTGGT^ 

TCTGCTTCCTGGATTCMGCGATTCriXnXKICrCAGCCTCCCM 

GCACATGCCACC^CACCTGGTTMCTTTTGTATTTTCTAGTAGAGATGGGA^ 

[T,G] 

TTGGTCAGGCTGGTCTTGMTTCCrGGCCrCrAGTGATCrGCCrGCCT 

GTGCTAAGATTACAGGCATGAGCrACCATGGCCAGCCCA^ 

GVGACATGTTATGGITTCTGGCACAATATTMGMGACATGA^ 

ATTTTAGGGCATCAO\AO\GAMGATTATCCT 

ATrrCTGTGWVTGTTCTAAMTATATAAMTCrGTATCrrTTGTC^ 



TTATTTCATTAATA I IGU I I I 1 1 I I I I I I I I I I CTGGAGACAGMTCTTGCrCTATCAC 

CMGGCTGGAGTGCAGTGGTGCGATCrCGGCrCACTGCAGCCrCTGCrrCCTGGAT^ 

GCGATTOTGTGCCrCAGCCrCCCMGCAGCTGAGATTACAGGCACATGCCACCACACCr 

GGTTMGTTTCTATTTTCTAGTAGAGATGGGATTTTGCCATGT^ 

GMTTCCTGGCCTCrAGTGATCrGCCTGCCrcrGCCTCrGAMGTGCrMGATTACAGGC 

[T,G,A] 

TGAGCTACCATCGCCAGCCCATTTCC^ 
TGGCACAATATTAAGAAGACATGATATGAMTCA^ 
AGAAAGATTATGCTATAAGAAAMCAATGGMTTCCAACTACATTTCT 
AAAATATATAAAATCTGTATL I I I I CTGTTCTCTCCTGATTTATATTCTAMTTTGATCT 
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TATCCTTCTCTGCAGAMTAMGTGTCTGAA (SEQ ID NO: 48) 

6919 ATGAMTCACAGGGTGMTTmGGGCATCACMC^^ 

MTGGMTTCCMCrACATTTarGTOW\TGTTCrAAMTATATAAM I I I 

CTCTTCTCTCCTGATTTATATTCTAAATTTGAT^ 

TCTGAMGAATGAAAAAMTGGMGMTTCTITAGTMGCTATAAMTACCCXrTCTATC 

TTTGTAGCATTCTMGCCTTTTGTCACC^ 

CG.C] 

TAGGCCACAGCCATGTACATTGATCCCTTTA 1 1 I I CI I CTCTCTGCCTGAGATTTCTCTC 
ATTCCCCCrrCTCTGCCTGGTATATGATTGCCCATTGriTM 

TAATCTTCCT AGCCCAL I I I CI I I ATCGGTATTCCAGAAAAMCAAAAGAAGCTTCCACA 
AGACAACATTCTCTMTACACTGCTTAAL I I LI I I I GACCCTGCTGAGTTCAAAAATCTT 
ATLI I I I IMGGATT(^TGGAGTC(^CCMGGTATCrATATTTGACAGGATTTATGAAA (SEQ ID NO: 49) 

7305 GATTGCCCATTGrrTMQGCCCO^ACTCACCrrTATMTaTCCTAGCCCAL I I I C I I IA 

TCGCTA TTCCAG AAAAMCAAAAGMGCTTC^ 

TAALI I LI I I I GACCCTGCTGAGTTCAAAAATCTTATC I I I I I AAGGATTGAATGGAGTC 
CACCMGGTATCTATATTTGACAGGATTTATGAAMCAAAAGGA 1 I IGI IGAGAAAGTTT 
GAAGCCTMCrCTGAMCOTGGATCATAGTGTTTACrACVVCATTAAL I G I I I IAGTGGAT 
[G,A] 

TAATAGTTATTATTATAQGCrGTGGMTCAGAACAGQGTTCAMTGTTT^ 
TAGACTGTGGCCTTGGGCATGTTATTTMTGCCrGGAGGCCTCAMTGTTM 
QGTMGACCTACCCAGTMCTTAGCATAAATAGTAMTTCATTCATT^ I I I ICAAA 
CAGTGCCAGACATTGTTTMTGMCTGGGGATATAG^ 

TTCATTGTATTCTCAAMCCCTCCCTAT^^ (SEQ ID NO: 50) 

7340 TAATCTTCCTAGCCCAL I I I LI I I ATCGGTATTCCAGAAAAMCAAMGAAGCTTCCACA 
AGACMCATTCTGTMTACACTGCTTAAL I I LI I I I GACCCTGCTGAGTTCAAAAATCTT 
ATLI I I 1 1 MGGATTGMTGGACTCCACCMC^ATCTATATTTGACAGGATTTATCW^ 
ACAAAAGGAI I IG I I GAGAMGTTTGMGCCTMCTCTCW\ACGTGGATCATACTGTTTA 
CTACACATTAACTG I I I I ACTGGATCTMTAGTTATTATTATAGGCTCTGGAATCAGAAC 
[A,G] 

GGGTTCAAATGI I I ICACCGCTTGCTAGACTCTC^KZCTTGGGCATGTTATTTMTGCCTG 
GAGGCCTCAAATGTTAACT AGGAATGCTMGACCTACCCACTMCTTAGCATAAATAGTA 
AATTCATTCATTTAATG I I I I CAAACACTGCCAGACATTGTTTMTC^CTGGGGATATA 
GTGGTGAACAACACTGACAGCG I I LI ICATTCTATTCTCAAAACCCTCCCTATAGTAAGT 
AGGTCTGTGTGTGTGTGTAGGTGCATGGGGMTA^ (SEQ ID NO: 51) 

7466 TTAAGGATTGMTGGAGTCCACCAAGGTATCTATA^ 

GGAI I IG I I GAGAMGTTTGAAGCCTAACTCTGAAACGTGGATCATAG I G I I IACTACAC 
ATTAACTG I I I I ACTGGATCTMTAGTTATTATTATAGGCTCTGGAATCAGMCAGGGTT 
CAAATGI I I I CACCGCTTGCTAGACTCTGGCCrTGGGCATGTTATTTMTGCCTQ 
CTCAMTGTTMCTAGGMTGGTMGACCTACCCAGTAACTT AGCATAAATAGTAAATTC 
[A,G] 

TTCATTTMTGTTTTCAMCAGTGCCAGACATrGTTT^ 

AACAACACTGACAGCG I I LI I CATTGTATTCTCAAAACCCTCCCTATAGTAAGTAGGTCT 

GTGTGTGTGTGTA(5GTGCATGGGGMTAAAAMTMTM 

TTCAAAMGCAGAAAGAGCTATTCMCAAMCTACCTGCCTTTTATTAG^ 

AACTCTATGG I I IGI ICTCTCCTGTCAATTCTGTTAMTGCTCTCAGCCTGTrTTCCTTA (SEQ ID NO: 52) 

7589 MCTGTTTTAGTGGATGTMTAGTTATTATW 

ATGTmCACCGCTTGCTAGACTGTGGCCTTGGGCATG^ 

AMTGTTAACTAGGMTGGTMGACCTACCCAGTM 

CATTTMTGTTTTCAAACAGTGCCAGACATTGTT^ 

CAACACTGACAGCG I ILI I CATTGTATrCTCAAAACCCTCCCTATAGTAAGTAGGTCTGT 
[G,C] 

TGTGTGTGTAGGTGCATGGGGMTAAAAMTMTM^ 

AAAMGXIAGAMG^GCTATTCMCAAAACTACCTGCL I I 1 1 ATTAGATGAAACTCTCAAC 
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TCTATGG I I Id I I CrCTCCrGTCMTTCTGrrAMTOTOTCAGCCTGrnTCCTTATCA 

CCCTGGCCACGACrrCTGrCTTTTCTGCrrGGTCCrGTAGACTCT 

CTCTGCCTGGCTATCTGCCrrCT^ 

CTGGGGATATAGTGGTGAACAACACTGACAGCCj I I LI I CATTGTATTCrCAAAACCCTCC 

CrATAGrMGrAQGTCrGrCTCTGTGrGTAGCTGCATGGGGAATAAAAAATAATAAGCAA 

ATMTGMCAQGGrMTTTCAAAMGCAGAMGAGCTATTCA^CAAM 

TATTAGATGAAACTCTCAACTCTATQCjI I lb I ICTCTCCTGrCAATTCTGTTAAATGCTG 

TCAGCCTGTTTTCCTrATCACCCT^ 

[A,C] 

CTCrMCCC^AGGCrC^TTCTCTGCCTGGCTATCrGCGITCT 

aACATTTTCTGTGTTGCACAGGGMGGA^ 

TGAMGAATTCATTGTGATTXSQGCCACAGCACAT^ 

CCACTGCrCAGCAGCrCTGGGGGAAMTGTTTACTGAGAAGCGrACACTAG I I I I I I IGA 
CTMCCATCCTGCAACCTCCTCCCA^ 

TTAMCGMTTATTGTAGAAACAGAAAMCAMTACTGTGTTCT 
TAMCCTTGGGrAMTGQGGCATAMGATGGGMCMTAGACACTAGGGACTC(^W\AGG 
GGGGAGGGAQGGAGGAGQGCMGGGCTGGAMGCTTCCrACrGGGTAC I I IGI ICACAAC 
CrGGGTGATGGCAC<^TTAGGAGCrCAMCCCCACTATCACACAGrATACCCrTCTAACA 
AGCTGATGGKrrMCCCCTGMTCrACAATAAMTTATTTTATTT^ 

[G,A] 

GGAI I I I I AAAAAGAAGGATTCCTAGACAGGTGCAGCCAAACAA I I I I I I I IAAATGTTG 
GCAGGCCGCCACCGCCAGTCACTTATGCTGC^ 

ACrrCTCTCCAAMGAGMGCTATAGTTCAGATGGCCCTGTGCTGGGTTCT 
GTTTCrGGQGAMGGGGCrrGAGTTGCCCCGACTGGACrcrrCCTGGAGTGG 
GCITCTGATCAGACGTGAGTGAGGCAQGMCTCCGCGGTCTCCCAGCGG\GCCCAGAGTG 
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(SEQ ID NO: 54) 



(SEQ ID NO: 55) 



9503 CATCTCCCMCATTCCCMCCTACTTCT 

CTGTGCTGGGTTCTCCCTGGAAGTTTCTGGGGAMGGGGCrrGAGTTG^ 

TCITCCTGGAGTGGGAGCCGGGGCrrCTGATCAGACGTGAGTGAGGCAGGAACT 

TCTCC(^GCGCAGCCCAGAGTGCGGTCCCACGC^GGTCCCGGGTCCTGCGCGCrCGCGCC 

TTTGCGCTGAAGCCGTTAGGATGAGCCCrcrCCrrCCAGAGCrTTM 

[A,T] 

TTGTGTTTGGCGCCCCTGAGGAGGATGCTC 

GTGGGCAGAGATCCCGTTCGTCGGTCGCAGrrCCACCCCGCrGGGGCTCACrCAGGCCG^ 
GGAGCTGCGAGGGAGACATCCTCGATGGACTCCCTCTACGGAGATCTCriTT 
GACTATAACAAGGATGGGACCTTGGACA I 1 1 I I GAGCTTCAGGAAGGCCTGGAGGATGTA 
GGGGCCATTG^TCTCTAGAGGMGCGMGCT 

9898 ACCCCGCTGGGGCTCACTCAGGCCGCGGAGCTGCG^ 

TCTACGGAGATCTCTTTTGGTACCTGG^ I MUG 

AGCrrC^GGAAGGCCrGGAGGATCTAGGGGCCATTCMTCTCTAGAGGAA 

GTCTCACTGGGGCTGTMTCAGAGAGACGTTGGGGCTGGG^GCCCTGGAG^ 

C^GAGAGGGCAAMTTTACATGTTGTCMGCrTGACCTGGGCCCACrGCA 

[G,C] 

GTTGACCAGCGTTACCGTTTATTAAGMTMCMCACA 
TTTCTCCGTTTTCTCCTTGGCTGTAGTAAMTCTCakA 

TGGCTACATACAGCCTTGTCTTAGGAGTCACCI IGI I CAATGTGCTCACCTGTCATTAGT 

CACCCAGAGGGGCGTCTAGGCTAMGATGCGCCCTCCCCAGTTCAGAGA^ 

CACTCTACGTGTATTTGGGAGTGGGGTGGTGATTGGAMTTT^ 

10196 GTGGTTGACCAGCGTTACCGTTTATTM^ 

Al I I I I CTCCGTTTTCTCCrTGGCTGTAGTAAMTCTCCMCITCAGATT 
TGTTGGCrACATACAGCCTTGTCTTAGGAGTCACL I IGI I CAATGTGCTCACCTGTCATT 
AGTCACCCAGAGGGGCGTCTAGGCTAMGATGCGCCCTCCCCAGTTCAGAGMCT 
MTCACTCTACGTGTATTTGGGAGTGGGGTGGTGATTGGAMTTTTCTGATGTTATG^ 
[T,C] 



(SEQ ID NO: 56) 



(SEQ ID NO: 57) 
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GGTTTCTGrrCCTGGMGGGG^ 

TGAQGTTTCCTCATAATATGCCTTMTTGATAGACCCTAGTTATCAGTACCGAGCTTAQG 
CTMCCCrrCTCrrCCCCAGAAGGCrMCCTACAGGCTCCrrcrCAGCATGrrC^ 
GTACATACTCCTATTGCAGrATTTCCAAGTCA I I I I I CATTTGGAATTTATTATTGTATA 
TMTMTTACTTTATAAGrATATTTGCTCTTTGGATGTT^ (SEQ ID NO: 58) 

GTCATGTTATTTMTGCCTGGAGGG^^ 
AGTMCTTAGCATAMTAGTAMTTCATTCAT^ 

GTTTAATGAACTGGGGATATAGTGGrGAACAACACTGACAGCGI ICI I CATTGTATTCTC 
AAMCCCTCCCrATAGrMGTAGGTCTCTGTCTGrGTGrAGGrGCATGGGGAATAAAAAA 
TMTMGCAMTMTGMCMTAAMTTATTTTATTTAAAAAAAMGAMT^ 

[C,G,A] 

TTCTCGTGTTAAGATACAAAAGCAATAAL I I I I I ATTGTGAAAATAGTCTG I I I I IGAAC 
AATATATTG 1 1 1 IGI I I I I 1 CCTGTGAMGTTGAGAMCTAAATATACGAAGAGATAATG 

gtcagaccatamtaaaaatagmctttgactcaamtttacagc^ 
ccagccctitatctaamtamcagacca^ 

mgtcaggttgctatctctagagacaatacacamgctatgcaataac^ (seq id no: 59) 

tacaggcgtgagccaccatgcgcccagccatagactatata i i i i i gatctgataactgg 
ttcagctactmgtgactmcaggcaagtagcatctatagtgtggata^ 
ggacattcacctcctgggcaggatggcacagm^ 
gaatggtgtgcaatttaaaacttatgagi igi i igi i i ctggagttttccatttaatagt 
tcagaccatggattgaccgc^ggtmctgamctgtggagagtgamctct 

[G,A] 

GGACTATTGTATTGTTMGTCAGACTCATTAGGCMTO 
AMTGCTGCAGAMTATGGGTTAAAAAAMCTGTT^ 

TAACI IGI I ACTTCCAAAATGTTAGTGAAMCTGTGGCCCCAMGAGTGAMGX]AACAAA 
TGACTAAGAGAAAATL I IGI I I ICAGGATGAGAGATTAAAAMGMGCAACTTGCTGAAA 
CACTGAAAATCTCTCCACTTGTAAGATAACA^ (SEQ ID NO: 60) 

ATAGGGTCAGGGATGTCCTTTAACI IGI lACTTCCAAAATGTTAGTGAAAACTGTGGCCC 

CAAAGAGTGAAAGGAACAAA7GACTAAGAGAAAATC I IGI I I I CAGGATGACAGATTAAA 

AMGMGCAACTTGCTGAAACACTGAAMTCTCTCCACrrGTMGATMCACAAM 

CTAAMCTGGTTGGMTGMTATGGCCMOT^ 

TTACAGCCCAMTTTCCACCACATATTTTATACrMCTCCCCCCGGATTT^ 

[T,C] 

CTGTGAGGTAGCATGMGAGGTMCTATGCATGCCTMGGACITGGGAGACCT 
TCCTTCCACCMTCACCCACTMTCCCAGMTCCGCCCCCAMCCTTTTCTMTMCTAC 
CTTAMGCCAGCATAGGGAGACAGATTTGAGCTGGACTCCTGTL I ICI I GTGGGTCACCT 
TGCAATAAAAAGC I I I ICI I I I CTCMCACCTGGTATTATAGTATTGACTTCTAGTTCAT 
CGGGCAGCAAGCCCCTTTTGGTCGGTGACTATTL I IGI I GGCTGATATTTCCATTGGCCA (SEQ ID NO: 61) 

ACTAATCCCAGAATCCGCCCCCAAACCI I I I CTMTMCTACCTTAAAGCCAGCATAGGG 
AGACAGATTTGAGCTGGACTCCTGTC I ICI I GTGGGTCACCTTGCMTAAAMGCTTTTC 
TTTTCTCAACACCTGGTATTATAGTATTGACTTCTAG^ 

TGGTCGGTGACTATTC I IGI I CGCTGATATTTCCATTGGCCAAMTATAAACCTCTTAGA 
TGAAACTTCAGT ACGTAAATGGCGCCACAGAATGCTGTGACA I I I I I CTCTTGGATTATA 
[G,A] 

CAGGTTACTTTACTGMTACCGTAGGCAGTTATM 

CATAGAAAAGATACAGTAAAAATATGGTAA I I I I I I I CMCTTTTAGTTGAGATTTGGAG 
GGTATGTGCACAI I IGI I ACMGGGTATATTGCATGATGCTGAGGTTTGGGGTACAATTG 
MCCCTGTCACCCAGGTAGTGAGCATAGTACCCAATCGATAA I I I I I C AACCCTTGTCCA 
TTCCCTCCCCGI ICI I GTAGTCCCCAGTTTCTGCTTTTCCCATCXITATATCCGTGTC»\ (SEQ ID NO: 62) 

CTCAACACCTGGTATTATAGTATTGACTTCTAGTTCAT^ 

CGGTGACTATTC I I G I I CGCTGATATTTCC^TTGGCCAAMTATAMCCTCTTAGATGAA 
ACTTCAGTACGTAMTGGCGCCACAGAATGCTGTGACA I I I I I CTCTTGGATTATAGCAG 
GTTACTTTACTGMTACCGTAGGCAGTTATM 
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AGAAAAGATACAGTAAAAATATGGr AA I I I I I I ICAACI I I I AGTTGAGATTTGGAGGGT 
[G,A] 

TGTGCACAI I IGI I ACMGQGTATATTGCATGATGCTGAQGnTGGGGrACAATTGAACC 
CTGTCACCCAQCTAGTGAGCATACTACCCAATCGATAAI I I I I CMCCCTTGTCCATTCC 
CTCCCCGI I CI I GTAGTCCCCAGTTTCTGCrrTTCCCATCnTATATCCGTGTGCACCCC 
ATGI I I I GCrCCCATGTGTATGTGAGMCrrGrGGrGTTTGCj I I I I CTATTTCTGCGTTG 
ATTCGCTTAQGATMTGGCCTTCAGCTGC^TCCATGTTGCTGCAGAGGACGT^ 



-n 
m 

CD 



AGGAGTTTATCMTTTTATTAGTCTTTTCAMGAACCATCTTT^ I I IGI IAATCCTC 
CCAATGGTGTG I I I ICI I I CTCATTAC I I I I I GCTCTTT ATTTCCTTCAAC I ICI I I I I I 
GCTTAATTTTAAAATAA I I I (.I I GAGATTGAGATAAGCCTCAATGATGGGTCACCGATTT 
CCAGTCI I I LI ICI I I I CTMTTATGCATTTTAMCCAGAMTCTTTCTCrMGTGTAGC 
TTTAGTTGCAGG"CACMGTTTCAGATCrGTCTCr<^GTCT 

[A,G] 

TGACCATGAAACCATCCAGTCACAATGTGGCATTA I I I I I I IAAI I I I I I I I I I I I I I I I 
TGAGATAGAGiTTTCACTCTTATTGCCT 

AGCMCCTCCACCTCCCAGGTTCAAGCGATTCriTTGCCTCAGCCTCCCM 
ATTACAGGCATGCGCCACCATGCCCAACTAATTTTGTA I I I I I AGTAGAGATGGGGGTTC 
TCCATGTTGGTCAGGTTGGTCTTGMCTCCCGACCrCAGGTGATCCGCCCACCT 



g 

ZD 

m 

to o 

m 

CD «^ 

(S§n)Ttf):63g 



CD 



(SEQ ID NO: 64) 



GTGGCATTATTGGTTCATA I I I I IAI I I I I I AGACTTCCTTAATGCAAAACATATACAGT 

TGATCCTCATTATTTGGGGATTCTGTATTTGCAA^ 

CC^MGTMCCCCAAAATATATACTCACAGTAGrTTCCCAGGCATT 

GAGCAGTGAAAMCITGAGTTGCrCAGC^TGrACATTCCTAGCTA 

ACTCTGCCI ICI IGI I ICAGCTCTCATACTATTMCTAGCAAGTATCCCTTTCAAGGTCT 

[G,A] 

TTTTGTGCCAG I I I I IGCAI I I I IGTAI I I I IGI I GGTAATTTCC I I I I I AAAATGTTCC 
CCAMGGTAGTGCTGAAGTGCTGTCTAGTGTTCCTMGTCCAA 
TTATGGAGAAMTATATGCGTTGGATMGCTTTGC^ 
CAGCACACATTAMTGAGGTGCCTTCAMQ\GAMCAGACATMGACAT 

AATCAGTTG^TGAAAGTGTTGTMTCAGAGGCTCACAO^AACCTAACCCTC I I I I I COG (SEQ ID NO: 65) 

CTCACAGTACTTTCCCAGGCATTCATGGACATGCACAGAGCAGT^^ 
T(^GCATGTACATTCCTAGCTAGTAGAATAAGGCAATACTCTGCCI ICI IGI I ICAGCTC 
TCATACTATTMCTAGCAAGTATCCCTTTC^GGTCTATTTTGTGCCAGI I I I IGCATTT 
TTGTAI I I I IGI I GGTAATTTCC I I I I IAAAATGTTCCCCAAAGGTAGTGCTGAAGTGCT 
GTCTAGTGTTCCTMGTGCMGAMGCCATAGCA^ 
[T,G] 

GATMGCrTTGCCCOW^TTCMTGTTAGTGAATCM 
TTCAMCAGAMCAGACATMGACATGGTTATGTATTMTCAGTTG^^ 
ATCAGAGGCTCACAGGAACCT AACCCTG I I I I I CCTGTAGGMCAATGGTTTGGT ATTTG 
CTMTTCAGTGTTTGCAATGMTATAGAACT^ 
GMTTMCC^TATCTCTTTMGAGTGCATTTCTAMGGA 

TCAGCATGTACATTCCTAGCTAGTAGMTMGGCWACTCTGCC I ICI IGI I ICAGCTC 
TCAT ACTATTA ACTAGCAAGTATCCCTTTCAAGGTCT I I I I IGCATTT 

TTGTAI I I I IGI I GGTAATTTCC I I I I I AAMTGTTCCCCAMGGTAGTGCTGAAGTGCT 
GTCTAGTGTTCCTMGTGCMGAMGCCATAGCATGCCTTATGGAGAAMTATATG^ 
GGATMGCTTTGCCCCAMTTCMTGTTAGTGMTCMCAGCACACATTA^ 
[C,G] 

TTOW\CAOWVCAGACATMGACATG^ 

ATCAGAGGCTCACAGGAACCTAACCCTG I I I I I CCTGTAGGAACMTGGTTTGGTATTTG 
CTMTT<^GTGTTTGC4ATGMTATAGMCrTTATC 
GMTTMCOVTATCTCTTTMGAGTGCATTTCTAMGGAG^ 

CATAAI I ICI I I ACTMCAGATGCTGCCTCTCACTGTCCrrACATGGTCCAGATTCTCAT 
TCTCTCAGMTCCTGTOVTCTCCTCC^^ 

CACTMCAGTMTTTTGGTCTTCCTC I I I I I CTGGAGAAGrCAGCTGTTTATGOGCTTC 



(SEQ ID NO: 66) 



(SEQ ID NO: 67) 
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AGCACCAGACCCTCTCTTAL I I Id I I I IGI I ICATTCI I I I I CATCTACACTACTCTTAG 

GATTCTCATCAGCCTGTGAGCTGCrAGAAGGAMTACAGCAGTGCTT^ 

CTATTTTATTTTCTATTTTCTCrrCCrcrCTTCTG^ 



GCrCrMTTTCCCTAGT ATTAAAA ATTTTCrGTL I IIIGIIGIICIII IATCCTTGCTCC 
CTTAI I I I I ACTGCCAGA I I I I IAI I I I lATTTATTTAI I I I I GAGATGGAGTCTCACTC 
TGTCACCCAGGCTGGGGTGCAGTGGCGCGATCrCAGCrCAGXKMCCTC^ 
CTTCMGCMTTTTCOCTmAGCCTC 

ATGCCTGGCTGAI I 1 1 I OA I I I I I AGTAGAGACGGGGTTTCACCATGTTGGCCACACTG 



20492 CACTCTGTCACCCAGGCTGGGGTGCAGTGGCGCGATCTCAGCrCACTGCAACCTC^ 
CCCAGCTTCMGCAATTTTCCTCTTTTAGCCrCCCMGTAGCTGGG^ 
CCACCATGCCTGGCTGA I I I I ICTAI I I I I AGTAGAGACGGGGTTTCACCATGTTGGCCA 
CACTGCTCTCTAACTGCTGACCTCAGGTGMCC^CCCGCCTCAGCCrCCAAAAGTGCT 
GATTGCAGGTGTGAGTCACTGTGCCTGGCCTTTTACTGCCAGA I I I I I AAAAGAATAGTC 



GTGCrTTAGCTCTATTTCCTCATTTACTACTTCTCrrTM 

TGCATAGTAMTGTCrAGTAATTTATTAAAAATGTAGAAATAGGTAC I I I lAAAATGAAT 
AGATCCTACTTTMTTGAATTTATC^ 
TGCTACrrCTTMTTACATTACTTGGTMGGCCACTTCT 
ATATTATTTATCTATMGGCTGTTACMTTACrGMTTTTAAAAM 



20868 TAGTMTTTATTAAAMTGTAGAMTAGGTACTTTTAAMTGAA^ 



TGMTTTATCrrGGAGTTAGMTATGTGATTTGGATTTTAGTTCTGCr^ I l<_l IAATT 
ACATTAaTGGTAAGGCCACTTGTCAAGTCAGTCrm 

AAGGCrGTTAC^ATTACTGMTTTTAAAAMTGTGTATTTA I I I I I I AATGTA I I IGI I A 
CAI I I I I AGTATTGATGTTGGGATAGGCATTTMGC^GTCTATAACTCACCTA 



MTTTTGCCTTMTCAGTTTAMGCrrTCT^CTTAMTGA 

CTGTGGI ICI I ATCAGTTCTGAGTTTTA I I I I I IGCCCI I I I IAI I I I I I IAAAGGAAAA 
ATTGAGGCTTCAGAAATTGTCCAGTCTCrCCAGACACTOjGTCrG^ 
CAAGCAGAGTTGATTCTTCAAAGGT AAGCTCTTCATGTTGGTCAACAATTGACTTTCACT 
TTMTATCCTGCATTAGMCTCTGTGTTTGTMGTGTGGCITTAAMC^ 



20941 GAGTTAGMTATCTTGATTTGGATTTTAGTTCTGCTAL I ICI I AATTACATTACTTGGTA 
AGGCCACITGTGMGTCAGTCrCTTTGGAGGA^TATTATTTATCT 
TTACTGMTTTTAAAAMTGTGTATTTA I I I I I I AATGTA I I IGI IACAI I I I IAGTATT 
GATGTTGGGATAGGCATTTMGCMGTCTATMCTCACCTACATGCATMTT^ 
ATCAGTTTAMGCTTTCTCTTAMTGAGAGATTTGAMTTCATM I ICI I A 



CAGTTCTGAG I I I IAI I I I I IGCCCI I I I IAI I I I I I lAMGGAAAAATTGAGGCrTCAG 
AMTTGTCCAGTCTCTCC^GACACTGGGTCTGACrATTTCrGM 
TTCTTCAAAGGTMGCTCTTCATGTTGGTCAACMTTGAC^ 
TrAGMCTCTGTGTTTGTAAGTGTGGCTTTAAMCACCTC 

TCCAAGATC I I I I IGTCI I I I I I CCTCCCATTCATTTTGTATGTGTACATTTATCTAAAG 



21116 GTATTGATGTTGGGATAGGCATTTMGCAAGTCTATMCTC^ 

CCTTMTCAGTTTAMGCriTCTCTTAMTGAGAGATTrGAMT^ 
TCTTATCAGTTCTGAG I I I IAI I I I I IG CC LI I I I IAI I I I I I I AAAGGAAAAATTGAGG 
CTTCAGAMTTGTCCAGTCTCTCCAGACACTGGGTCTGACTATTTCTGM 
AGTTGATTCTTCAMGGTMGCTCTTCATGTTGGTCMCM^ 



CTGCATTAGAACTCTGTGTTTCTMGTGTGGCTTTAAMCACCTCCCTACT 
GTATATCCAAGATC I I I I IGTCI I I I I I CCTCCCATTCATTTTGTATGTGTACATTTATC 
TAMGTGTMGAATGGGAAGTGTAAGCTCAGACTGGACTC I I ICI I I CAAGGCCTCAAAG 
GATAGTGGAATGGCAGGAAGTAAGG I I I I AACTCCATAGATGAGGAGCTGAAGAG I I I IG 
GTGTTGCTTTTTCTCCATTTGATTTC 



(SEQ ID NO: 68) 



(SEQ ID NO: 69) 



(SEQ ID NO: 70) 



(SEQ ID NO: 71) 



(SEQ ID NO: 72) 
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CATTGATTCAAACTMGMGACTAGCAGAT^ 

GAAAAAAQGGAMTTACTMGCTCrCCAAGCTMCAMGAMTACCrGTTTAMC^ 
GAAAACA(W\ATGCAMTTTGMCCTTATTGTCTGGGGCM 

CAGACTTTTATACTCTTAATG I I I IGI I I CATQQGATAGAGCAGTAATCTCTGCAGCCCA 
GGTGCTCTCAMTACTCTGTTGCTATAMCACAQGGCAGGMCT 1 1 I I I I ATGATAAC 
[G,A] 

TAAMCAG7W\AGGACAATTATATTGTATTM 

ATTGTCTAAAAATCrrTCrAAATGGC I I IGI I ATTGMTTTATCTCATTTTATATCTGTG 
CCAACAGCATTTTCATCCTTTCrCTTCATAAl I ICI I I I ACAAACAGCTGCrCAAGAGGA 
A&XCTCAAAGTCrCAAGGCTGAGCACGTAATGAC I I I IGI I AGTACTAGATGAGAAGGGC 
TTTCCTGAGGAMTGAAMCCrAAMCATGAAAAGAAGATAAACAGMTTT^ 

AMCTAAGMGACTAGCAGATTCATCACATTATT^ 

GAMTTACTMGXXCTCCMGCTMCAAAGA^ 

MTGCAMTTTGAAOTTATTGTCrGGGGCMTC^GTTTGA 

ATACTCTTAATG I I I IGI I ICATGGGATAGAGCAGTAATCTCTGCAGCCCAGGTGCTCrC 
AMTACrCTGTTGCrATAAACACAGGGCAGGAACTGA I I I I I I ATGATAACGTAAAACAG 
[A,-] 

AMGGACAATTATATTGTATTMTATTGTT^ 

AAAT CnTCTAAATGGC-l I IGI I ATTGMTTTATCTG^TTTTATATCTGTGCCAACAGCA 
TTTTCATCCrrTCrCTTCATAA I I ICI I I I ACAMCAGCTGCTCAAGAGGAAGGCTCAAA 
GTCTCAAGGCTGAGCACGTAATGAGI I I IGI I ACTACTAGATGAGMGGGCTTTCCTGAG 
GAMTGAAMCCTAAMCATCAAAAGMGATAMCAGMTTTGGA 

G^AAATGCAMTTTGAAanTATTGTCTCGG 

TTTTATACTGTAATG I I I IGI I I C^TGGGATAGAGCAGTAATCrCTGG\GCCG\GGTGC 
TCTCAMTACTCTGTTGXTATAMCACAGXiGCAGGAACrGA I I I I I I ATGATAACGTAAA 
ACAGAAAAG GACAA TTATATTCTATTAATATTGTTGTGMTA^ 
TCTAAAAATCTTTCTAAATGGLI I IGI I ATTGAATTTATCTCATTTTATATCTGTGCCAA 
[C,T] 

AGCATTTTCATCCTTTCTCTTCATAA I I ICI I I I ACAAACAGCTGCTCAAGAGGAAGGCT 
CAAAGTCTCAAGGCTGAGCACGTAATGAC I I I IGI I AGTACTAGATGAGAAGGGCTTTCC 
TGAGGAAATGAAMCCTAAAACATGAAAAGAAGATAMCAGAATTTGX^ 
AGAGCATATMTATTCTGCrrCTAMGTMTATTCTTCTAGGA^ 
TQGCTCTTAGGCCAGAAATCATATTCCTATA I I I ICI I I GATAGCTTTAGGAATAATGCA 

T GAACC TTATTGTCTGGGGCAATCAGTTTGACrATTTM 

TGI I I IGI I I CATGGGATAGAGCAGTMTCTCTGCAGCCCAGGTGCrCrCAAATACTCTG 
TTGCTATAMCACAGQGCAGGMCTGATTTTTTATG^ 

TTATATTGTATTAATA I IGI I GTGAATATTTTCAGTCCrCAC^TTGTCTAAAMTCTTTC 

TAMTGGCTTTGTTATTGMTTTATCTCATTTTATATCT 

[-,T] 

TTCTCTTCATMTTTCTTTTACAMCAGCT 

TGAGCACGTAATGAC I I I IGI I AGTACTAGATGAGMGGGCTTTCCTGAGGAMTGAAAA 
CCTAAMCATGAAMGMGATAMCAGAATTTGGACAGrGAGATATAG^ 
TCTGCTTCTAAAGTM TATTCT TCTAGGAMGTGAGGGCGT^ 
GAAATCATATTCCTATA I I I ICI I I GATAGCTTTAGGAATMTGCAMTTCTAAGCCCAA 

GMCCTTATTGTCTGGGGCMTCAGTTT^ 

Gl I I IGI I I CATQGGATAGAGCAGTMTCTCTGCAGCC(^GGTXjCTCT 
TGCTATAMCACAGGGC^GGAACTX^TTTTTTATG^ 
TATATTCTATTMTATTGTTGTGMTATTTTCAGTCCrCA 
AAATGGCI I IGI I ATTGMTTTATCTCATTTTATATCTCT 
[-,C,T] 

TCTCTTCATAAI I ICI I I I ACAMCAGXTGXITCAAGAGGMGXKTCAMGTCTCAAGGCT 
GAGCACGTAATGACI I I IGI I AGTACT AGATGAGAAGGGCTrTCCTGAGGAAATGAAAAC 
CTAAMCATGAAMGMGATAMCAGMTTTGGACAGTGAGATATAG^ 
CTGCrrCTAMGTMTATTCTTCTAGGAAAGTGAGGGCGTTTCCCT 



33 



K5 

CO 



rn 

CD 



CD 
CO 



2D 

m 
o 

m 



(SEQ ID NO: 73) 



(SEQ ID NO: 74) 



(SEQ ID NO: 75) 



(SEQ ID NO: 76) 
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AAATCATATTCCTATA I I I I CI I I GATAGOTTAGGAATAATGCAAATTCrAAGCCCAAG (SEQ ID NO: 77) 

ACCTT ATTGTCTGGGGCAATCAGTTTGACTATTTMCT AATGT 
I I IOI I ICATQGGATAGAGCACTMTCTCTGCAGCCCAGGrGCTCTCAMTACrCrGTTG 
CTATAAACACAGGGCAGGAACTGA I I I I I I ATGATAACGTAAAACAGAAAAGGACAATTA 
TATTGTATTMTATTGTTGTGMTATmCAGTCCTCACATTCT 
ATGGCI I IGI I ATTGMTTTATCTCATmATATCTGTGC^CAGCATTTTCATCCrrT 

C-,c] 

TCTTCATAAI I I LI I I I ACAMCAGCTGCTCMGAGGMGGCTCAMGTCTCAAGGCTGA 
GCACGTAATGAC I 1 1 lb 1 1 AGTACTAGATGAGAAQGGCTTTCCTGAGGAAATGAAAACCT 
AAMCATGAAMGMGATAMCAGMTTTGGACAGTGAGATATAGAGCATATMTA^ 
GCTTCTAMGTMTATTCTTCTAGGAMGTGAGGGCGTTTCCCTGGCT 

ATCATATTCCTATA I I 1 1 C_ I I I GATAGCTTTAGGAATMTGCAAATTCTAAGCCCAAGCT (SEQ ID NO: 78) 



ATATTTTCAGTCCTCACATTGTCTAAAMTGTTCTAAATGGL I I lb I I ATTGAATTTAT 
CTCATTTTATATCTGTGCCMCAGCATTTTC^TCCTTTCTC^ I I I CI I I IACAA 
ACAGCTGCTCAAGAGGAAGGCTCAMGTCTC^GGCTGAGCACGTMTGAC I I I IGI I AG 
TACTAGATGAGAAGGGCTTTCCTGAGGAM^ 

CAGMTTTGGACAGTGAGATATAGAGCATATMTATrCrGCTTCTAMGTMTATrcrrC 
[C.A.T] 

AGGAMGTGAGGGCGTTTCCCrGGCrGTTAGGCCAGAAATCATATTCCTATA I I I ICI I I 

GATAGCTTTAGGMTMTGCAMTTCTMGCCCMGCTTCAGAA 

AGCTTAGCTGCCATGACAAMTACCATAGGCTGGATXKATTAM 

TTTCACAGGTCTGGGAGCTGGGMGWMGATGAGAGTGCCAGC^ 

TGAGGGCTCTCXTTCTGGCrrGCAGATAGAC 

CATTGTCrAAAAATCTTTCTAAATGGC I I IGI I ATTGMTTTATCTCATTTTATATCJGT 

GCCAACAGCATTTTCATCCTTTCrCTTCATAA I I ICI I I I ACAAACAGCTGCTCAAGAGG 

MGGCTCAMGTCTCAAGGCTGAGCACGTAATGACI I I IGI I AGTACTAGATGAGAAGGG 

CrTTCCTGAGGAMTGAAMCCTAAMCATGAAMGMGATAAACAGAATTTGGAC^ 

AGATATAGAGCATATMTATTCTGCTrCT^ 

[G,T] 

TTCCCTGGCTGTTAGGCCAGAMTCATATTCCTATA I I I ICI I I GATAGCTTTAGGAATA 
ATGCAMTTCTMGCCCMGCTTCAGMTAGACTMGMGTATTAGCT^ 
OW^ATACCATAGGCTGGATGCATTAMCAATGGAMTTTAG I I I I I CACAGGTCTGGGA 
GCTGGGMGTTTAAGATGAGAGTGCCAGCATGGrrTGGGTTOT 
GGCrrcCAGATAGACCCCTTCTCACTGTATTGTCATATGGC^ 

GAMGTGAGGGCGTTTCCCTGGCrGTTAGGCCAGAMTCATATTCCTATA I I I ICI I I GA 

TAGCTTTAGGAATMTGCAMTTCTAAGCCCMGCrrCAGMTAGACrM 

CTTAGCTGCCATGACAAMTACCATAGGCTGGATGCA^ 

TCACAGGTCTGGGAGCrGGGMGTTTAAGATGAGAGTGCCAGCATGGT^^ 

AGGGCTCTCTTTCTGGCrreCAGATAGACCCCTTCTCACTGTA^ 

[-,A,G] 

AGAGAGAGAGAGAGAGAGAGAGAGAGAGAGGGGATCTTTCTCT 

CCATAGTCCTGTTGGATONGGGTTCCAT^ 

GATGCTATCTCCAGATATMTCACACG^ 

GGACA<^GCTCAGTCCATAGCAAAGGATMTGCAGAGGGTTGGA^ 

CACAAI I I I I MTATAAATATTTTATGGTAAC I I I I I I I I I I I 1 1 I GAGATGGAGTCTAG 

AT(XrTCrCrTGCrrTCrATTATMGGC<^TAGTCCT 

TGACTmTTTGACTTTACCCCCCT^ 

TAGGGCCTCM<^TTTGGATTTGGGAGGGA 

AGAGGGTTGGATATTTAAAAGTAGCTACACAA I I I I I AATATAAATATTTTATGGTAACT 
I 1 1 1 1 1 1 1 1 1 1 1 I GAGATGGAGTCTAGCTCrGTTGCCCAGGCrGGAGCGCMTGGTGCGA 
[A.G.T] 

CTCAGCTCACnKAACCTCCGCCrCCCAGGTTCAAGCAATTCTCCTGCCT 
AGTAGTTGGGACTATAGGCACGCGCCACCACGCCTGGCrA I I I I I I I I I IAI I I I I ACTA 



(SEQ ID NO: 79) 



(SEQ ID NO: 80) 



(SEQ ID NO: 81) 
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GAGACGGGmGCACCATATTGGTCAGGOTGTCrCGMCTCCTGACATCAGGTGATCCA 
CCCATCrrGGCCTCCCAAAOTGCTGGGATTACAGMGrGAGCCACCGCGCCTAGCCAGCA 
GOTTACTGAGATGTMTTCACATGCCATAMTTCACriTTCrAMGTATACM (SEQ ID NO: 82) 

ATATMTCA(^CGGTGGGrrAGGGCCTCMCATTTGGATTTGGGAGGGACACAGCT 
CCATAGCAMCKSATMTGCAGAGGGrrGGATATTTAAAACTAGCTACACAA I I I I IAATA 
TAMTATTTTATGGTAAC I I I I I I I I I I I I I I GAGATGGAGTCTAGCTCTGTTGCCCAGG 
CTGGAGCGCMTGGTGCGATCrCAGCTCACrG(^CCTCCGCCTCCCAGGrrCMGCAAT 
TCrCCTGCCrCAGCCTCCTGAGrAGrrGGGACrATAGGCAC(K:GCCACCACGCCrGGCrA 
[-,T] 

I I I I I I I I IAI I I I I ACrAGAGACGGGTTTGCACCATATTQGTCAGGCrTGTCTCGAACr 
CCTGACATCAGGTGATCCACCCATCTTGGCCJCCCAMGTGCrGGGATTACAGAAGrGAG 
CCACCGCGCCTAGCCAGCAGCnTACrGAGATCTMTTCACATGC(^TAMTTCACrm 
CTAMGTATACMTTCAGTGACrTAAMCATTTATTTA I I I I I AAATTGACAGAATTACA 
TGTATTTATCATGTACMG^TGT^TG^ (SEQ ID NO: 83) 

TTCTCTTACTAI I I I I CMGAATATMTATATTATTATTAATTGTAGTCTTCATGTTGTA 
TAGTGG7VGTTCTTG7\ACTTATTCCrCA^ 

ACCATACCCGACTCCGWVGTATTCTGCrcrCTGCTTCTATGAGATTAAC I I I I ICTGAT 
TCCAGVTGAGTGAGATCATGCAGTATTTATTTGTCTTTACCTGGCrTA^ 
TGTTACAGATAACAGGATTTCC I I CI I I I I I I AATGGCCGAATAG I I I I CTATTGTATAT 
[A,G] 

TATAGOVCATTTTCTCTCTTCATGCATTQGTGGACACT^ 
TATCCTGMTAGTGCTATMTGMCATGGGMTGCACATGGCrCrrTGAC^ 
CATTTTATATATGTGTATATATATATGTATACACACACATACA 
AQGATCATATGGTAGTTCTATATTTAA I I I I I AMGGAACTCCATACTGCTTTCCATAAT 
GGCTGTATTAGTTTMCTCCTCACCAAC^QGGTGCAAMGTTCCC I I I I CTCTACATACT (SEQ ID NO: 84) 

I I IGI ICTAGAGTATAGTTTAAGTCTGATGI I I CI I ACTGATTTTCTGTTGAGATGATTT 
GTCTATTGCTGMGGTAGGGTGTTG^GTCCCCTACTATTGCrG^^ 
TCCTTTCAGACGTATTAATGG I I I I I ATTTTATTTTA I I IGI IGI IGI IGI IGI IGI IGT 
TGI IGI I I I I GAGACGGAGTCrCACrCTGrCACCAGGCTGX^GTXXj^GTGGCAGGOT 
GGCTG^CTGCAGCCCCCCTCT(^CGGTT^ 

[G,A] 

CTX5GGACTACAGGCGCATGCCACCACGCCCAGCTAA I I I I IGI Al I I I I AGTAAAGACGG 
GGTTTCACCATGTTGXKXAGGTVTGX^^ 
GCCTCCCAAAGTGCTGGGATTACAGGTGTGAGC^ 
TATCTTTAGGTGXTCTGATGTTOKITC^^ 

CTTATTMGQGATATGC^TATAAMTATATAMTTCT (SEQ ID NO: 85) 

TCTGATG I I ICI lACTGATTTTCTGTTG^GATGATTTGTCrATTGCT 

TG MGTCCC CTACrATTGCTGTATTGCAGTCTCrCTCTCCm 

TTTATTTTATTTTA I I IGI IGI IGI IGI IGI IGI IGI IGI IGI I I I I GAGACGGAGTCTC 

ACTCTGTCAC<^CX5CTX^GrGCAGTGGCAGGGTCrCGGCT 

CGGTT<^GCGATTCTCCTTGC(^^ 

[T,C] 

CACGCCCAGCTAAI I I I IGIAI I 1 1 I AGTAMGACGGGGTTTCACCATGTTGGCCAGGAT 
GGTCTTGATCTCrrGACITCATGATCCACCCGCCXrGGCCTC^ 
AGXTrGTGT^GCG^CCACCCCTGGCCMT^ 
GGTTCATATATATTTATAAAAMC^TAGXTACATMCTTA 

AMTATATAMTTGTGACACTGWW\TTTAAM (SEQ ID NO: 86) 

AGTXKTTXXX3ATTACAGGTGrGAGCCACCACCCCTGGCCMTC 
GTGCTCTGATGTTQGGTTCArATATATTTATAAA 
GGATATGCAATATAAMTATATAMTTGTGACACrGAAMTTTA 
GrAAAAGTACCTTCATATMCrrACTATTATATCCTCT^ 

TTATATAGGAAC I I IGI 1 1 CTCCTTTACMCITCTGACTTAAAG I I IGI I I I ATATGATA 
[T,C] 
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rrf 

MGTAMGrrACTCCrGCrCTCCriTGGTTTCTGTTTCCATGGAATATC I I I I ICCATTC c9 

CTTCACCATCAGTCreTGTGTAI I I I I AGVGATGAMTGAGTCrGTGVTGQGCAGCATAT ^ ^ 

AGTTGGATCTAGI I I I I I I AATCCACTCAGACACTGTG I I I I I IGATTGGATAATTTAAT fn ^ fH 

CCATTCATGTTCMGGTMTTATTGATMGT^ ^ 03 ID 

GTTTCATGGI ILI I I I ATAGATCCnTATTL I I I ICI ICCTCTCTTGCTGTU I I I I I I I (SEgrfD NS>87)0 

5 ^ m 

26060 GGTTTTTGGTTTCTGGrrTACC^ oa ^ ^ 

ATTTTAACTTGATAACTTAA I I I I I ATTGC^AAAACCCCCCAAAACAAAAAAATCTACAC « ° 



^ m 

SO o 



TTTTACTTMTCCCCTGAAATTTTGAA I I I I I GATGTCACAGlTTACCrCTTTTCATATT 
GTGTATCCCTTAMTTATTGTAGCTATTATTACTTTTM _ 
AGATGTMGTGATTTGCATACCATCATTAG^^ 

[C,T] 

TTTTATCAGCCAGriTTTATACTTTCAGATG I I I I I GTGTTACTCATTAGCATC I I I I ICT 
TTCAGCTTGAGGAGCTCCrTTTACGI I I CI I ATAAAATAGGTGCGGTCATGATTATCTCC 
CTCAGCTATTGrrTGTCTQGGAMGTATCTCTCCrrCATTTCTCM 
GTACATTACCCrTGGTTQGTA I I I I ICTCCTrGMCGCTTTAMTATATCATCCCTTTCT 
CrCCTGACCTGTTAGGTCTCTGCTGACCAGTCTGTTTCCAACCATATTGGGACT (SEQ ID NO: 88) 

30245 ATTTTMCCATCC^TTGTITCrGCTTCTCrAGATMCCCTGACrM 

GAAGTGATATCTCATGGCnTGATTTATA I I ILI I I CATQGCTAGTGAL I I I I 1 1 IGTAC 
TTTTGGGATATTGTTATTATTATTATTATTATTACT I I CI ICAGTAAAA 

GTGTTAGAAACAAI I I I I AMQGCAGMTGrGACCAGAGTTTCCTGTAGTTATATAACCA 
TCATGGACCTTCCCTCMGTGCTMGCCATTACT 
[C,G] 

TTGI I I I CI I CCATTTCACTGrCrCTTTGrGrCCCAMCTTGMTTCATGQGAAAM 
CrCWVTGGTGCrrMTATGGmGGATATTTGrCCCCTCCAMTCrCATGTrGAMTATG 
ACCrCCAGT&TTGGMGTAGGGACTACrrGGCTCACGAGAGTGGATCaTCATTM 
TTGGTMTMGTGAACrCTATTAGTTCATGAAAGCTGb I IGI I GATAAGAGCCTGGCATC 
TCATTTCrCTrGTCCrrCTCrCACCATCTGACACACTTGCrCACC I I I I I I CI ICAGCCA (SEQ ID NO: 89) 

33664 TTCCAGACTCTAGMGTACACTGTCCTATCCTr^^ 

CAGACACTATATGAMC^GGGAMTTAGAGGCCMGATACCrATGACrTATA^ 

TTTAMGAAMTATTAGCAMCTGMTCAGCCATTTTAAAAMTATACC^ 

ATTCATMGAGCAGCTTAACAAAAI I IGI I AGAAGGCATTAAAGAAGACTCAGTATAGAA 

AAGATCTACCTrCTCTCCAMTTGGTGAT^^ 

[G,T] 

GTTTTTTTGAGGAACTTGTCAAGCreAGTCrCA^ 
GAATATCCAGGACATTCCTGAAGAACTGTMG^ 

AAGGGI IGI I ATTAAGCCATMCCMGTCAGTGCrGrrTTCTACAGAMCAGACM 
CAAGTGAMCATMTAGAGAGCCCAGAMCAGACCCATCCATATTTTG^ 

GAAAGAAGTAGCrnTGCAAAACTTTOGGAAAAGGAG^ (SEQ ID NO: 90) 

33883 TAMGMOOX^GTATAGAAAAGATGW 

TGCCATTAAAAAAACCCACCTGG I I I I I I IGAGGMCTTGTCM&CTGAGTCTCAAATTT 

ATATCAMGAGCAMGGCCrAAGMTATCCAGGACATrCCTGAAGM 

GGGGCCTGCCCT ATCAGATACCAAGGG I IGI I ATTAAGCCATAACCAAGTCAGTGCTGTT 

TCTACAGAAACAGACAAGTTMCMGTGAMCATMTAG^ 

[C,A] 

CATATTTTGGATTTGTCACGTGAAAGAAGTAGCriTG 
GTGTGC^TAGATGATGCTCGTGCT^^ 
TTACCGTACACAMCACOWICTAMCGTGAM^ 
GGGAAGAMTATCTTTATCTCAGTGTAGGGMGMTTTATTTTAAAAAG7\A 

GGCC^TACATAGGMTGAAMGATTGMTTCAGCTGCAT^ (SEQ ID NO: 91) 

34373 TATCTITATCrCAGTGTAGGGAAGAATTTATTTTAAAMGMGAC^ 
TAGGMTGAAAAGATTGMTTCAGCTGCATTAAA^ 
CAAGAGCATCTGTACTTGGACAGCATAGAGTGGAAAGACA 

TTATMCrrGMGGATTAGMTGMTGATATAAAGAACT ATGT AAATAAGAAAAAGACAT 
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"CACGTGAAGGAAACAGC 



ACAACCGGTT AGAAAMCGGGCAAAGACATGAACAGCATA" 
[G,A] 

GTAGCAMTGMCATGGTMGAGATGCTCMCACGTTT^^ 

G TTATAC CCACAGCAAGACTATCTTATCTAGGMGT^ 

GGTTTTMGCTACAGAGTTTGTMTTCATTTATT^ 

ACTGTTTTAGAMCCTTGGTTATMCTTTGMT^ 

GAQGATGCTTATGTXnXHQGAGrraQGTGGrGGGGrCAM 

ACTTGAAGGATTAGAATGMTIGATATAM 

CGGTTAGAAAMCGGGCAMGACATGMCAGCATATT^ 

OWVTGMCATQGTAAGAGATCCTCMC^CGriTAGrMTrr 

TACCCACAGCAAGACTATCTTATCTAGGMGTTT^ 

TTMGCTACAGAGTTTGTMTTCATTTATTTAT^^ 

[G,T] 

TTTAGAMCCrTQGTTATMCTTTCMTGAMTTAAAAAAMTCCTTGCCT 
TGCTTATGTGTGGGGAGTTGGGTGGTCGGGTC^^ 

AGTGACATAMTAAACCTATAMTATTGCMCCCAGAGTTATATTATAAATGTMGTA^ 
GACTAGGACTCTCATGCAGATATACCTCTGT^^ 

TTCCG\TATGG\AGTGW\ATAAAMGTGACACrAGAAMCAG\ATMTGAATATCTGAA 



GGCATTTAAGT ATTCrGCCATAGGGAAGTGTAAAAGrrTGTAGGC I I I IACI I I I IATAQG 
TACrATATTGTCGWVTMTCrCAGCACCrCATGGTTGCTAAGGATCrGTGTCL I IGI I I 
GGTCAGATTATGTTTATCTCTGGC^^ 

ATI I I I I I GCTTCATCTGCTTAGCATTTCATACCACj I I lb I I I I CCACCAAAGTTCAAA 
TTTTGATTGTTTGUTMTATTCrGCATACrGATGTAMCCMGTT 

[T,A] 

CTGCrCCTGAAACCCTTAQGAACrCTCrGAAGGAG I I I IATTTAI I I I I IGI I I I IGI I I 
TTGI I I I l(j| I I I I I I I I GAGACGGAGTCTTGCTCrGTTGCCCAGGCTAGAGTGCAG 
TGGTGCGATCTCGGCTCTCrGCAMCTCGGCCTCCGGGGTTCACGCCATTCTCCT 
AGCCACCGGAGTAGCTGGGACTACAGGCACCCACC^CrGCGCCrGGCTAA I I I I I I I IGT 
ANN I AGTAGAGACGGGGTTTCACCGTGTT AGCCAGGATGGTCrCGATCTCCTGACCTT 

TTGAGACGGAGTCTTGCTCTGTTGCCCAGGCTAGAGTGCAGTGGTGCGATCTCGGCT 
TGCAMCTCGGCCrCCGGGGTTCACGCCATTCrCCTGCCTCAGCCACCGGAG^^ 
ACTACAGGCACCCACCACTGCGCCTGGCTAA I I I I I I I IGTAI I I I I AGTAGAGACGGGG 
TTTCACCGTGTTAGCCAGGATGGTCTCGAT^ 

TCCCAMGTGCrGGGATTACAQGCGTGAGCCACTGTGCCCGGO L I I I I I I I I I I I I I I I I 
[T,-,C] 

TTTATQGGCTTGTCrTCTACACITCAGATTTGACrAM 

CAGGAGTTCA^TTGCCACTAGTMCAA^ 

TTGGTGATTAGTGCCTTCTCAGCTATGACT 

GCCrAGATAAATTGT ACACTATGTCAAGTTTTATTTACATMTTCTTACGGT A I I I 1 1 I A 
AGCTAGTTGATMCAGITGAGACTACM 

GMTTOTAAAAATATTATTATAGMTTGTTTCTCTCAAA 

TGAAGGGGTGATGATTTGAMCMTACCrCTCCATTAGCrAMTTTTATAT^ 

TGCATGTTTTAMTGATMGTCAGATTTATAAAAATATTT^ 

GTTTAGGGGTATTCACATACAGTTTTAA I I I I I ATTTACATATTTAAAACATATCATGGT 

ATAMTATCATGTCGATATAMTTTGAGATAM 

[T,G] 

MTTTCTTAAMGATGTCATCACCAGTTGGTTT^ 
AAAAMGATTGACTATGATAAAATGCTGCCCT 

rGTAA 



TTCATTTTAACCT AGACCAAGAGAAAAC 
ATACTGTGMTCTATGATGMTGAMGAMGTTGTMCTGTrGGTTTTGTATATT 



TTACTGTTTATTTTCA I 1 1 <_ I I GTGAACTGATACTGTAL I I lb I I CATTGTGAGT AGACA 
ACTTATMTCTATGTACTCAA^TTGGTTTAGTATAMTTCT^ 

TGTTATACTTATGGTCAACAL I I I I I ATATTTGTCTGTAGATTTCTGTAOWW\GATTC 
TGACACTG I I I I MGCCAGCATTCCTTCAGMTGTACCCAMTCTCAAMTTTATTTAGG 
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GGCAAAGCTAATGCTTTAAAGAAAAAGGAGA 
[A,G] 

GGGATTGGTGTGTG I I I I I CI I I AGGAACAGTAGTMCTTGAC I I I I AGAGAACTTGAAT 
AAGCATTTATTTTTTCCTTTGTCCTATTTTATTOT 
GGATTTCTCTGGAATTTAGTTTCT^ 
TTlGATACTTCTCTAGAAAGACrCACATMCTCACTCAM 

Tn"ATTACGGGGAAAAGATGCGGATGAAMTCAGTCAAGTAMGMGCA (SEQ ID NO: 97) 

TTATATCATTCTGCTTTTA I I I I I AGGTTCACGGTTCAAAATCAGACAAAATGAACATAT 
TTGGTGGCTTTCGACAGATGGTAAAAGMGGA 
GTACAMCGTCATCAAMTTGCTCCTGAGAC^^ 
TMTTGTTATCACCC GTCGM TTTATTM 

TGTTAATGTATAATGC I I I IGGGATTCI IGI I I lAATACATGATAATCTTTCACATATAC 
[T,C] 

CCATMGGAGGATCACrrATAGGAGATTAGACTAMTAAMTCAGAGATTTCT 
AAGTTATGGGATTCTTMTTCATCATATTATTTATAAAG I I I I I I I I I I CTAAGTAGTTC 
TTAAAGG MGGGT AGMTTTTAGTTTATTCATTCr 
MCATMGTTTTATGAMGTGTCACMTCTMCCTCTGGM 

TCCTTTGTGTMTTTGACGTTGCrGTAAMTTGACKTGAGTTTGGAGTG^ (SEQ ID NO: 98) 

AMTTGCTCCTGAGACAGCTGTTAM 
GTGGMTTTATTMCAMGAGGAGTTAGTAMCGG^ 

CTTTTGGGATTCI IGI I I I AATACATGATAATCTTTCACATATACCCCATAAGGAGGATC 
ACTTATAGGAGATTAGACTAMTA AMTCAGAGA TT^ 

TTAATTCATCATATTATTTATAAAG I I 1 1 I I I I I I CTAAGTAG I I LI I AAAGGAAGGGTA 
[G,T] 

MTTTTAGTTTATTCATTCTGAATCCTGAGCAGAAGCAGCACA I I I IATG 

AMGTGTCACMTCTMCCTCTGGAAGGAAM 
GACGTTGXTGTAAMTTGAGXXGAGTTTGGACT 
CTTCTTCCCCATGTACTCCAGCAC^^ 

GTGTTGAATGAGTCAATGMTGMCAMTGCATTTACCTCTGM (SEQ ID NO: 99) 

TGGGATTLI IGI I I I AATACATGATMTCTTTCACATATACCCCATMGGAGX^ATCACTT 
ATAGGAGATTAGVVCTAAATAAAATCAGAGT^TTTCTGVTGACCAA 
TTCATCATATTATTTATAAAG I I I I I I 1 1 I I CTAAGTAG I I LI I AAAGGAAGGGTAGAAT 
TTTAGTTTATTCATTCTGMTCCTGAGCAGMGCAGCAC^ I I IATGAAA 

GTGTCACMTCTMCCTCTGGAAGGAAAACT ATMGTTGMGTCCrrTGTGTMTTTGAC 
[G,A] 

TTGCTGTAAMTTGAGCTGAGTTTGGAGTGAC^^ 
TTCCCCATGTACTCCAGCACCTAGACAGAGCT^ 
TGMTGAGTCMTGMTGMCAMTGCATTTACCTCTGMTCACTTCTCT 
GTTMCTTGGATTATTTGAGCrATTGCrrCAGCCrMCTCMTGTAM 

AGGTMGTTTTAGAGTTTGGGTTCTCrTTATGGTCATTAGCAGMCT (SEQ ID NO: 100) 

C\TATACCCCATAAGGAGGATO\CTTATAGGAGATT^ 

TCATGACCAAGTT ATGGGATTCTrAATTCATCATATTATTTATAAAG I I I I I 1 1 I I ICTA 

AGTAGTTCTTAAAGGAAGGGT AGMTTTTAGTTTATTCATTCTGMTCCTGAGCAGAAGC 

AGCACACTMCATMGTTTTATGAAAGTGTCACAATCTMCCT 

AGTTGAAGTCCTTTGTGTMTTTGACGTTGCTGTAAMTTGA 

[C,G] 

CTCCATGAAGGCAGGGGCGTGGLI I LI I CCCCATGTACTCCAGCACCTAGACAGAGCTTG 
GCATGTGATMGTTTCAAGCGAGTGTTGAATO 

TCTGAATCACTTCTCTGTCGGL I I I IGI I MCTTGGATTATTTGAGCTATTGCrrCAGCC 
TMCTO\ATGTAAAGGGGAAATACAGAGGTAAG I I I I AGAGTTTGGGTTCTCTTTATGGT 
CATTAGCAGMCTGTCTAGTTGAGCAGCCACAGATTATGI I I I CCATTATTTATTCCATC (SEQ ID NO: 101) 

ATMGGAGGATCACTTATAGGAGATTAGACTAMTAAMTCAGAGATTTCT 
GTTATGGGATTaTAATTCATCATATTATTTATAAAG I I I I I I I I I I CTAAGTAG I ILI I 
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AMGGMGGCTAGMTTTTAGTT^ 
CATMGrrTTATGAAAGTGTCACAATCTMCCT^ 
CTITGTGrMTTTGACGTTGCTGTAAMTTGAGCrGAGTTT^^ 
[G.C] 

GCAGGGGGCTGGCI I CI I CCCCATGTACTCCAGCACCTAGACAGAGCTTQGCATGTGATA 
AGTTTCAAGCGAGTGTTGMTGAGTCMTGMT^^ 

TTCTCTGTCGGL I I I lb I I MCTrGGATTATTTGAGCTATTGCrrCAGCCTAACrCAATG 

TAMGGQGAAATACAGAGGTMGTrm^GTTTQQOTTCTCm 

ACTGTCTAGrrGAGCAGCCACAGATTATCTTTTCCATTAm 

GCACCTAGACAGAGCrrGGCATGTGATMGTTTCMGCGAGTGTrGAATGACTCMT^ 

TGMCAMTGCATTTACCrCTGAATCACTTCTCTGTCGGL I I I lb 1 1 AACTTGGATTATT 

TGAGCTATrGCrrCAGCCTMCrCMTGTAMGGGGAAATACAGAGGTAAG I I I IAGAGT 

TTGGGTTCTCrTTATGGT^TTAGCAGM 

TCCATTATTTATTCCATCATTGTTTA^ 

[C,-] 

CCCCCATAGI I I I I GTATTATTCCATGrAGATTTTAGATTATTCTGGAGAGTG I I I IGI I 
CTTGAGCAAC^GAATACTCTTGAGMGATTACGAAGTCCAGTGGrATCL I I I I LI I IGCC 



ro 
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(SEQ ID NO: 102) 



48859 



49126 



49378 



ATTTTMTTAGMCCTATCCTrGGGAtt 

AMTCATGATTATTMGGGCTMTGrrTGAGATACCCrrAGGTTATTCT 

CATTTACCTCTGAATCACTTCTCTGTCGGC I 1 1 IGI I AACTTGGATTATTTGAGCTATTG 
GTCAGCCrMCrCMTGTAMGGGGAMTACAGAGGTMGTTTT^ 
TTTATGGTCATTAGCAGMCTGTCrAGTTGAGC^ 
ATTCCA TCATTGTTTATCAAGGTVCTGTMG^ 

I I I I I GTATTATTCCATGTAGATTTTAGATTATTCTGGAGAGTG I I I IGI I <_ 1 IGAGCAA 
[G.C] 

AGMTACTCTTGAGAAGATTACGAAGTCCAGTGGTATCL I I I I LI I I GCC TAGGAA ATAG 

GMCCTATCCTrcGGAAGGCTATTTTCCTTATATGAAG 

TATTAAGGXKTAATGTTTGAGATACCCITAGGTTATTCTGACC^ 

GATAGGAMGXZCACAGCCTAAMTAMTAMTACTCM 



(SEQ ID NO: 103) 



(SEQ ID NO: 104) 



GATTATTCTGGAGAGTG I I I IGI I LI IGAGCMCAGMTACTCTTGAGAAGATTACGAAG 

TCCAGTGGTATCL I I I ILI I IGCCTAGGAAATAGAGMGCAAAAAAAAAAAAAAAAAAAA 

ATTAMGAAMTCTAGTCrCCAGG^TTTTMTTAGAACCrATCCrTGGGA^ 

CCTTATATGMGGTTTGMGATTCAMTCATGATTATTMGGGCrMTGTTr 

CrrAGXJrrATTCTGACCACATACrrGX^TTTTATGATAGGAM 

[A,G] 

TAAATACTCMTGCAGTTATTTCAGTATGCAAGAAGTTTGGTA I I 1 1 I GAAAAAGTCCAT 

GGGTATTGCMGCAMTATGCACATTTTGCT^ 

ATACCACCMCAGGCATCCTCTGCTTCTGTCCACCCAAGCTCCr^ 

TAGTATTGTGATTTCTGCACACTAAL I I ILI I AGACATQ\AGAGAAAGCTGTCTACaCAG 

TGTGGTGTAG I I I ILI I ATGGGCTCTGGACCTATGGTGCTG 1 1 I I CTCTCCTCCTGCTGA (SEQ ID NO: 105) 



TGACCACATACrrGGATTTTATGATAGGAAAG^ 

TGCAGTTATTTCAGTATGCAAGAAGTTTGGTA I I 1 1 I GAAAAAGTCCATGGGTATTGCAA 
GCAAATATGCACATTTTGCTTTATGCCATT^^ 
AGGXL^TCCTCTGCTTCrGTCCACCCAAGCrCCTTCCrGAGACCT 
TTTCTGCACACTAALI I ILI I AGACATGMGAGAAAGCTGTCTACACAGTGTGGTGTAGT 
[T,G] 

TTCTTATGGGCTCrGGACCTATGGTGCTG I I 1 1 CTCTCCTCCTGCrGAAGGTCCATTCAT 
CCCTCGGGGCTCTCrAAMGrCACCrrCCrGTGACAAG^ 
AAGCCAGTTCCTCCCCTGTCCAGCCT^^ 
TCATTGGATGATGGAAMCCCATTGTTTTCCCAGTGG^TTGT 

TAGGCTGTATATATTCTCAMTTTCCCAGAGTATGTMCTAGGTCAL I I I I AGATTCAGA 



(SEQ ID NO: 106) 
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49482 TCCATGGCTATTGCMGCAMT^^ 

CTTGGATACCACCMCAGGCATCCrCrGCT^ 

CTTTATAGTATTGrGATTTCTGCACACrAAL I I I LI I AGACATGAAGAGAAAGCTGTCTA 
CACAGTGTGGTGTAG I I I ICI I ATGGGCrCTGGACCTATGGTGCTG I I I ICTCTCCTCCT 
GCTGMGXJTCCATTCATCCCTCGGGGXT^ 



CTMG^TCrCMTCAMGCCAGrrCCTCCCCTGrCCAGCCTCCCrCC^GrGCrGMTTG 
CAGAATATCCCA I I I I I CATTGGATGATGGAAAACCCATTG I I I I CCCAGTGGATTGTAA 
ATTACTTCGGGGTAMTAGGCrGTATATATTCrCAMTTTCCCAGAGT ATGT AACTAGGT 
CACTTTTAGATTCAGATAGA I I I IGI I CCTTGAATAGCTAGTACTTTAGGAAACTAAGAA 
AMGATCTITTCMCCTGGTATGTAGCTCTGTCAMCACATC^TCAGT^ 



49741 CTCGXSGGCTCTCTAAAAGCCACCTTCCTCT 

GCCAGTTCCTCCCCrGTCCAGCCTCCCTCGAGTGCTGMTTGCAGAATATCCCAl I I I IC 

ATTGGATGATGGAAAACCCATTG I I I I CCCAGTGGATTGTAAATTACTTCGGGGTAAATA 

GGCTGTATATATTCTCAMTTTCCCAGAGTATGTMCrAGGTCACTTTTAG^ 

GAI I I IGI I CCTTGAATAGCTAGTACTTT AGGAAACTAAGAAAAAGATCTTTTCAACCTG 



TATGTAGCTCTGTCAMCACATCATCAGTATGGGGTAMCCTGTGriTCT 
CATTACC^TAGTAGTGTCATTGTATCATTGACAGTGTMTAGTGTGGG ICI IG 

TGGTTTCAGCTGCCACTCrGTACrGACTGXTTTCCACTCCA (SEQ ID NO: 108) 



49840 ATCrrTTCAACCTGGTATGTAGCTCTGT(^ 
TTCTCrGTCGGTTGTCATTACCATACT 



TAGTGTGGGGTAGTG I ICI IGTGGTTTCAGCTGCCACrCrGTACTGACrGCnTCCACTC 

CAACATCTTCCTCTTTATCTCAACACTCTAGCT 

CTCTGCTTGCATGACCCAGGAGTGCCrCC^ 

TTCTGCTACrCCCTGTCTCCTGACCCrGCTCCAGCAACACAGACAGACACCC^ 
TCrATATGTCATATGGTGGGGMTGCCCrrTAGTAOTACTCAGGAG^ 



50102 CATTACC^TAGTAGTGTCATTGTATCATTGACAGTGTAATAGTGTGGG^ ICI IG 
TGGTTTCAGCTGCCACrCTGTACT^CrecrTTCCACrCCM 
MCACTGTAGGTCTACCTGTGTACTGTGTGTTTCAGCATCTCTGCrrG 
GTGCCTCCCACTCAATATGGCCACCATG^ 
GACCCTGCTCCAGCAACACAGACAGACACCCTTCCT 



ATGCCCTTTAGTACTTACTCAGGAGTTAGTTCCTC 

TTTTGTTACAGCACTTTCAC^TTGMTTCrGACGTTCT 

ACTGTG^GCrrCCrrAGGCAGTAGCTACrrGTATTCrrAGCACCTTGCCC^ 

AACCCrTATTMGTAAATGAAMGACAGMCTGACAGA 

CCTCAATCTCAAGCCATTAAGATCAAGGG 



50109 ATAGTAGTGTCATTGTATCATTGACAGTGTMTAGTGTGGGGTAGTG I I C I I GTGGTTTC 
AGCrGCCACTCTGTACrGACrGCrrTCCACTCCMCATCTTCCT 
TAGGTCTACCTGTUTACTGTCrGTTTCAGCATC^ 
CCACTCMTATGGCCACCATGCATGGTCATCTTTCrGCrA 
CrCCAGCMCACAGACAGACACCCrrCCrCTTTCTATATGTCATA^ 



TTAGTACTTACTCAGGAGTTAGTTCCTCrGGGA^GCCTTCTGTTCrAGTTT 

ACAGCACTTTCACATTGMTTCTGAC^ 

GCTTCCTTAGGCAGTAGCrACTTGTATTCTTAG^ 

ATTMGTAMTGAAMGACAGMCTGACAGACrGGMTTAGAGCTCMGCrrGCCT 
CTCAAGCCATTAAGATCAAlGGGGAGCGGGGCOT 



50747 CCAGCCTGGGCMCGTGGCAAMCCCCATTTCrACAAAAMTATAAAM 
TGGGGGTGTGTGCCTGTACrCAGGATGCTGAGGTGGGAGGATCAC^ 
AGAGGTTGC^GTGAGCTGGGATCACACCATTGCWCTAGCCTGGCT 



(SEQ ID NO: 107) 



(SEQ ID NO: 109) 



(SEQ ID NO: 110) 



(SEQ ID NO: 111) 



[A,C] 
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CTTGTCTCAAAAAAAAM^ 

(^GCAGCATGTGGACAGAAATGrAOK^O\AGMGG(^TCACrCACrGMGAGAa"GAA 



GTGGTTCACTGTXXiCTCMGACrGGTQGAGTGrGnrTTCCGGAM 

CTGGAG^GATAAACAGGGGCOW\TCT 

GGGGCTATTGTAGCATCTTATATAGGGMGTGAMTG^ 

TCMCCTGAAAAMGAGTGGAGACATTGrrGGGGGAGAGTGAGGTAGACrAGAGGCAGGG 
AGMTATTTAMTMTTGAGGTAAGAMTGATGMCACCAGTATMQGTGATCT 



51272 TAGACTAGAGGG\GGGAGMTATTTAAATAATTGAGGT AAGAAATGATGAACACCAGTAT 
MGGTGATGTCTTTMGGMTGGAGAAGGGAATGMCT^ 
TCMCACtMCTCACTGACTGACTGCtATA^ 
ATTCTMTTTCTMCTTCVVGTGACTGC^^ 
GTGCATGCTCAGTTTGACiATGTGTCC^CATOT 



TATAT(^GCrAGCCATTMGAGAGAGATCnTGATAGAGAGGI IGI I GCTGAGTTGAGCC 
ATTGGMTGGGCAGGATCACTCAAGMGAGCTTATAMTG^ 
TCCAMGGGAGMGTAAMGMGAMCTTGCAMQGACACTGAGMGAMTAGCT 
GATGQGAGAAMTCCAGAGAGAGGGATGGCATAGGAGT(^GTGGMGGAMCGGrTTCAT 
QGGGGTCAGrACrACrGGGTAGTGAATATAATAAGAATATCI I I I AQGATTTCTCAACCC 



52842 TCAGGGTGGTTTTGAGGGCTCAGTTMGTCTC^^ 

QGCMGTTAaTAMCTCTCrCTGACTATTACCrC^TCrcrMGATGGGGACrMGCrTG 
GTGACATAG I I I I ACATACCAGGCACAGTGCCTGAC I I I I I GGCTCrGTCCTGAAGTCTT 
CCCTTTGTATATGGTATGTTTCGGGGAATAGG^ 

TATCCTCCATCAGTCACTAAACGTTTACTCTGTAC. I I I IGATAGGTGCTOTGGGGCTCCA 



QGTATAAMGGTACCrrCAAAGTTACTGTTAMGTGCAGGAAQb I I I I IAAGCAAATTAT 
CTTT AA TGATTTTGACAATCTGACATGCAGGAAAATTAATAGQGCCTATC 
GTTTTATGrMCACrcrGTAGrrCAGGAMCAGAGCCCTTGGMGCAOT 
QGAQGAATGTCTQGTATTTQGGAATCTCATGAAATGATAATATACrTAA I I I I IATCATG 
AGCAGCAAMGVCAGATTTGCTAQGAGAAAGTCATCGTATG 1 1 G I I GCATTGGGCACTTT 



61837 GAGGAACCTCCATGTCATTTTCCATAGTAACTAGACC I I I I IGI I I 1 1 I AACATTTCTAT 
CMTCTACACCMGATTCCAATTTC^^^ 
GGTCTACTACTATTGCrGTGTTGCrGTTTATTCCTCCCTTC^ 

TCATATATTTAGGAGCrTAATATTAGGTCCATATGAAGTTATAA I I I <_ I ICCTGGTAAAG 
TGACCCATTTATCATTATGTMTGTCCAT^ 



TCTAT TTTGTC rGATGTMTTATGGCCACCCCrr^ I I I IATGG 

AATATL I I I I I CCATCOTTC^CXrTCAGCTTATGTGTGTCCTTAGATCTAAAGTGAGTC 
TGATAGATMGCTATAGTTGATTCrCTAT^ 
GTTAGGGGATTTMTCC ATTTAC ATTTAAAGCAGTTACTGA^ 

GTCATTTGGCTAGCTACC I I I I I ATCTTTGTCCTGTGGCTTTTCTG I I I I ICCCTTCCTC 



62018 CATATATTTAGGAGaTAATATTAGGTCCATATGAAGTTATAA I I I CI ICCTGGTAAAGT 
GACCCATrrATCATTATGTMTGTCCATCTTTGTCT 

TCTATTTTGTCTGATGTAATTATGGCG\CCCC I I 1 1 CrCTTTGGGTTCCCG I I I I IATGG 
AATATL I I 1 1 I CCATCCTTTCACTTTCAGCTTATCT 

TCATAGATMGGTATAGTTGATTCTGTATGTGTTATTCACTCAGCM I I IA 



TrAGGGGATTTAATCC ATTTAC ATTTAAAGQVGTTACrGAT^ 
TCATTTGGCTAGCTACCI I I I I ATCTTTGTCCTGTGGCTTTTCTG I I I I ICCCTTCCTCT 
CTTCCTGGCI ICI ICTGTGI I I IGI IGAI I I I I I I 1 1 I I I I I GTAGTGATATGTTCTGAT 
TCCCTTCTCATTTCCCTTTGTGTGCATTCTATAGATGCTA I I I I I GTGGTTACCATTGCA 
ACTACATAMC<ATACTAMGTTATAGCMCTTATT^ 



(SEQ ID NO: 112) 



(SEQ ID NO: 113) 



(SEQ ID NO: 114) 



(SEQ ID NO: 115) 



(SEQ ID 



[G,A] 



NO: 116) 
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GACTGAMTTCAGACACATGCAGTCTGATT^ 

GMCTTTGCATGACTGATACGGCTGATAGATTGrcrATGGCTGA^ 

ACCTAAAAGTCTGATCATTTTACATCTGriTCAGACATCTTTG^ 

TCCAAAGI IGI I AGTGGGMTTTCAMGCCTTTAATAATCTAGCCCCAC I I IGI ICACTC 

TCTGTGTAATMCG^CATACMCMTra 

[A,G] 

TTGTU IGGI I CTGCCAGCAACACTQGrrTTTCGCrrrcrCTTCCrGL I ICjI igaggtcat 
TTCCAAGGCCCAGGTCTTTGTGL I I I I I CCCAAGCTTCCCAGAGL 1 1 CI ICCATACTCCC 
CTTACTTCCr(5AGATTTMCTGTrCTCTCTTCAGCGCTTGrcrACT 
AGCAGCACTGTGGGGTGGTGGAMGTGTACCAGCTTTGGAGTCAGACC^ 
CCCTACCATTTTCTACTTAGA I I I I I I IAGGACAMTTTCTCCATCTTTCTAAGCCTCCA 

TCTAGCCCCACI I IGI I CACTCTCTGTGTMTMCCACATACAACMTTGGCTGCATCTC 
CATAGCACATGGT ACrCCTCCCGTTGTCTTGGTTGTGCCAGCAACACI'GG I I I ICGCTTT 
CTCTTCCTGCI IGI I GAGGTCATTTCCAAGGCCCAGGTCTTTGTGC I I I I ICCCAAGCTT 
CCCAGAGCI I CI I CCATACTCCCCTTACTTCCTGAGATTTAA 
TTGTCTAGTMGAAGGAGGCAGCAGCAGCACTGTGGGGTGGTGGAAAGTCT 

[G,A] 

GAGTCAGACCATTGGATCT(^GCCCrACCATTTTCTACTTAGA I I 1 1 I I I AGGACAAATT 
TCTCCATCTTTCTMGCCTCCMTTGCTCACITACAAAATTGA^ 
MGATTGGTATGGAAGGTMTTMCC(^GTATTTAGAACATAGTMTTMTAMTMCTA 
TTATTACCATCATTACTATAGTTAGGACACTCACTGTTAGCT 

TAAAAGGGATGTTGTCTTGGGC I I CI I GGMTAMTGTTGTCCTTTTACTGTATTTTAGA 



<T5 



o 
r>o 

CO 



□D 
to 

r-o 
cz> 



ZD 

m 
o 

EH 
< 
m 
o 



(SEQ ID NO: 117) 



(SEQ ID NO: 118) 



66092 TTGGATCTCAGCCCTACCATTTTCTACTTAGA I I I I I I I AGGACAAATTTCTCCATCTTT 
CTAAGCCTCCMTTGCTCACrrACAAAATTGATATAACATT^ 
GGAAGGTAATTAACCCAGTATTTAGMCATAGTAA 

ATTACTATAGTTAGGACACTCACTGTTAGGTGCTATACAMGAGGATCATAAMG^ 
TTGTCTTGGGCI ICI I GGAATAAATGTTGTCC I I I I ACTGTATTTTAGAATATCATTCTG 
[G,A] 

GTCATAATTG I I IGI I GTCATAATMTGAMCATACTTGAATATTAAATTACCCTC 1 1 1 I 
TTTAI I I I I I AGCCATGTTAGMGGTTCCC(^CAGCTGMTATGGTTGGCCTCTTTCGAC 
GMTTATTTCCAMGMGGMTACCAGGACriTACAGAGGCATCACCCCAM 
AGGTGCTCCCTGCTGTAGGCATCAGTTATGTGG1TTATGAAMTATGM 
GAGTAACCCAGAAATGATGTTGCA 1 1 I I I I GCTTTAGCCTGATMTTXaAAACTTTCAACA 

66617 ATGMGCAAACTTTAGGAGTMCCCAGAMTGATGTTGCA I I I I I I GCTTTAGCCTGATA 
ATTGAMCTTTCAACAATCTCTGGAGTGAC I I I I I CTCCTCGAATTGAAACAAGTCTATG 
GCAAAAGAAGCTGCAI I I I I I I CACAAAAGGGMGATGGTMCMTGGTCACTTCAAACT 
TTTGGGCTAMTTATATGTACACAGAMTGTTCAAAATCATAG I I I I AATGTG I I I IGAA 
AAGGCCACACMTTATACTTTATC I 1 1 ICI I AATAATCCTGCAAATCTCTGCCCTGAATC 
[C,T] 

GAMTCTGAAMTGTACTGGCTTGAACAAAAI MUM I GTGTGTTAGAGTTATAAATCA 
TTMTCTTTATTrCGGGTGGTTTACGTTTATGCCAGTTCCTmTATTTAM I I ICI IGT 
TTTATATATTTTGAA I G I C I I IATAGAI I ICI I I AAATTTCCTTATAGAACCATTAATAG 
AAMTCATTACATTTAAAATATACC^ 

TGTCCTTAI I 1 1 ICI 1 1 CAGCTGMTACGMTGAGCACAGTGGTGGMTTTCTGAAGGGA 



(SEQ ID NO: 119) 



(SEQ ID NO: 120) 



66892 ATCCTGCAMTCTCTGCCCTGMTCCGAMTCTGAAMTGTACTGGCrTG^ 
GTTT 

TATAGAI I ICI I I 



rGTGTGTTAGAGTTATAMTCATTMTCTTTATTTCGGGTGGTTTACGTTTATGCC 
AGTTCCnTATATTTAAAl I ICI IGI I I I ATATATTTTGAATGl 



rcr 

AAATTTCCTT ATAGMCCATTAATAGAAAATCATTACATTT^^ 
MGCATCCAMTMGTATAGGGTTTATGTCCTTA I I I I ICI I I CAGCTGAATACGAATGA 
[G,A] 

CACAGTGGTGGMTTTCTGMGGGMGTGATGAMTTATATTTATTTC^ 
TCCATTTTACCACTGTACCATTATTTGGTTCCTGGAGTTO 
TACTGTTAMTTACCMCAO\AGGCMTTTATTTGAM 
CTTTGAAMGCAGCAGGAMCGAMTCCrrTGACTTGTATCAGCTTCT 
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TGTTTTCCTrTGTCLI I IGI I I CCTACCTTTTGAATCAGATTCCG I I I IAGTCAGGAAGA (SEQ ID NO: 121) 

CACTGTACCATTATTTGGTTCCTGGAGTTATACACrMTTT^ 
TTACCMCACMGXKIMTTTATTTG7W\G^ 

CAGCAGGAMCGAMTCGTTGACTTOTATCAGCTTCTGCAGAGCAT L I I I GI I I I COT 
TGTC L I I l b I I I C CTACCrrTTGMTCAGATTCCGTTTTAGTCAGGAAGA L I I C I I G GGA 
CCATTCTTAGTAACCrGAAA I I I LI I I I I I AATTGCATGAAGTGGATTGATCATGAGCAA 
[G,A] 

TGATGTGCrTATTTCTCCCTC7\CrcrrGAATAT 
C^GCACAMGGTGAGAGATACATATTMTAGTAGTATGTATTACTCm^ 
CCTATATTTAMTGAMQGCCCAATTTGTAMCATATACATTCATATTCTCr 
AACTTTTAGGMCATCTTAGGATATAG I I I I I I 

TATTTTACTAAAGCCA I I I I I ATAGTCAACTATL I I I I <_ I I ATTTGTGTGATTAGAACTT (SEQ ID NO: 122) 

ATAGTAGTATGTAmCTaTATACATTAGATACCTATATTTAMTGAMGK 
GTAMCATATACATTCATATTCTCTCTTGCC^^ 

GAGACTTAATTTATAATAATGAGAGCA I I I I I I I A TTTTACTAAAGCC A I I I I I A TAGTC 
AACTATCI I I I (J I ATTTGTGTGATTAGAACTTAGAAAMTATTTACTAGTTGAAGTTAT 
TATCAGI I I I I AATTTAG I I LI I AMCTCATTTCACTTCTMTMTTTCTGTTATAMTT 
[G,T] 

CCAGCATTTT AATGAAAATCT AATGATGTAATAGGCA I I I ILI I I ATTTGAACCTACCTC 
TTTTAI I I I L I GMCCAMGAGAAAGATGGACrGGTGTTTGTGAAACA I I 1 1 IAAAAATG 
TAGTTTCATTTATATTAGTTATGTTTGATAAATGTCTCAGTA I I I I I ATAATATGATAAG 
CCTGGGATTCTACTTTTAGGGTTATTTGTACriTTGA 

AAGGTACATGATCAGCTCTTTCTA I I I I I ACTCGTAAAMTTATGGAAATGAATAATTTT (SEQ ID NO: 123) 

ATTTCTGTTATAAATTGCCAGCATTTTAATGAAMTCTAATGATG^ 
TTATTTGAACCTACCTCTTTTATTTTCrGMCCAMGAGAM 

AAACA I I I I I AAAMTGTAGTTTCATTTATATTAGTTATGTTTGATAMTGTCT^GTAT 
TTTTATMTATGATMGCCTGGGATTCTACTTTTAGGGTTATTTGr^ I I I IGAGTAATA 
TATAMGTGACAATATTMGGTAC^TGATCAGCrcrTTCTA I I I I I ACTCGTAAAAATTA 
[C,T] 

GGAMTGAATMTTTTGCTMCAACTTTGAMTTTCAMC^ 

TTCATTGTTCATTATGMTTTAMTTGTMGGTATGAATGTGATTTGTCT 

TATLI I I I CCAAAAMTGATTCTGTATCTTTTGGAAAAAAGCCGAG^GTTGMGATAGTA 

TATTTCTGGTAGTACTXjMTATTTACrTACAGTT^ I I IGI I ICT 

AAAATTALI IGI I I ICCAGI I I I IAI I I I 1 1 I I AGAGAAMTTCTTAAGTCTCAGTTTCC (SEQ ID NO: 124) 

TTCAGAMTMCTTATCAGTTATTTCTGTAAGL I ILI I GCrrACCTGGATACCPGACAGG 

TGAGATGGCTGTAG<IAGACACrGGCAGTTCCCTGCCG\CACACCTGTC 

TGCACMGGCAGCTCTGTXrTGCAATTGCCAGCATCrGCrCCT 

TGTTAGAAAAATGCTGCCATA I I IGI I I CTCACCT ATTAGTCTTGTCTCCCAGTCAAGAG 

MTAMTTTATGCMGCAGAGATTGTACTTrACAGTATTTTGTCTITG^ 

[T,G] 

GTTGCATTTUrAAAMTGTCGCATC 

I I I IGI I CrCATGGAACTTCL I I I I I IGAAMGAGCACCAAAGGAGTAAAAATACTGTGG 
AGGGAGCMCCCTCCTTTGCCATATGCTCTCATTGX^GAW 
CATTTAGGXiCACrCTCTTGGGAGAGCACATCCrATGATGTTCT 

ACTGTGCT(^ACTCCAAGCTGACG\GCrTTCTGA (SEQ ID NO: 125) 

CTGTGTGCMTTGXICAGC^TCTGX^CCrcrGTTCT^GGGAATL I I IGI IAGAAAAATGC 
TGCCATAI I IGI I I CTCACCrATTAGTCrrGTCrCCCAGTCAAGAGAATAMTTTATGOV 
AGCAGAGATTGTACTTTACAGTATTTTGTCTTTGAGaTGGCA™ 
AMTGTGGXATGGCTTCCrCATCCCCCMTAGGMCTTTGCCAGCCL I I I IGI ICTCATG 
GAACTTCLI I I I I I GAAMGAGX^CCAMGGACTAAAMTACTGTGGAGGGAGC^ACCCT 
[C,T] 

CrrTGCCATATGXTCTCATTGQjAGACATGTXXaAO 

TCTGGGAGAGCACATCCTATG7\TGTTCrCCCAGCCrAGXlCCCTTCCACTGTGCT 
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CMGCTGACCAGCTT^^ 
TCCTATACCCAGA (SEQ ID NO: 126) 
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